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(54) Title : DNA CHAIN USEFUL FOR XANTHOPHYLL SYNTHESIS AND PROCESS FOR PRODUCING 
XANTHOPHYLLS 

(57) Abstract 

The following DNA chains relate to xanthophylls having a keto group, represented by astaxanthin, and the 
following technique relates to a genetically engineered production of xanthophylls: a DNA chain having a base 
sequence coding for a polypeptide having an enzymatic activity of converting the 4-methylene group of a 0-ionone 
ring into a keto group; a DNA chain having a base sequence coding for a polypeptide having an enzymatic activity 
of converting the 4-methylene group of a 3-hydroxy-£-ionone ring into a keto group; a DNA chain having a base 
sequence coding for a polypeptide having an enzymatic activity of adding a hydroxyl group to the 3-carbon atom 
of a 4-keto-p-ionone ring; and a process for producing various xanthophylls, such as canthaxanthin and astaxanthin, 
^mtroducing the above DNA chain(s) into a suitable microorganism, e.g., Escherichia coli. followed by expression 
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% 2 0 HP (i n Agrobacte riam a u r a n t 1 a c u s s 

p. nov. If K 1 «fc XS- Alcal igenes sp. PC-1 O v >f - 
tt££fifc&S&££tr*U-> h 7 ^ ^4^«gK i + t > 

* % « > isiitin^D/^f'jn Agrob 

acteriam auranti acus sp. nov. MK1 fe J; Cf Alcaligene 
s_sp. PC-1 \z&*tZ7X?*-*yl->W®>rhm : $: 
ttftt^7^( yh^n^y-f K ) © £ lc W ffi 
ttDNA IJU & tf> d © DNA H^^A I Izfc&mZMR I 

yh^D^y-T K) > * fc 5 N 77Ht>f .7 
x-3 4 l 1f>f->> 4 - Hr'7 + t>f A 

^ 5 o 

* » 08 C «fc * » H A i»> 7 r ^ = ii )l i& K V> 
& & £C S % Jig MO W tc tt N Iff IH ( 1 ) , ( 1 0 ) 4s £ & 

( 1 9 ) \z «k 0 * $ ft . g * W \z It , if E ( 2 ) , ( 4 )' , 
(11), (13). (20)*«fcCf(22)fcJ:e)3i8*ns 
DNA m (2 ) «fc (4 ) © * # 05 *< I& IB 
(6 ) *> <fc (8 ) T <9 > DNA £g (11) *> J; (13) 
© Hf*0J j&<«J!E (15) * <fc (17) $ to (C> DN 

A £ (20) *s «fc Xf (22) ©Ifti^iE (24) «fc 
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( 2 6) T h % o t£ £ , DNA m ( 3 ) . (5 ) , (7 ) , 

(3 ) • (12) , (14) . (16) , (18) . (21) . 

( 2 3 ) , ( 2 5 ) «fc & (27) (i , * ft ? ft N DNA g (2 ) 
. (< > ■ (6 ) . (« ) - (ID . (13) , (15) , 

(H), (20). (22), (20 £<ktf(26)K#LT^ 
X h U > ? x > Mi^ftTTA^ij^^xtUOT! 
& 5 0 

5 y K E ?'J # £ S ft K E ?'J # -t 1 ~ 2 X cf 5 - 6 ( % 1 
3-150) titt SHE Lfc J: ?41} 
5£ $G ffi ( £ * if E ?IJ # 1 (I1H) T I* 7 $ y g£ # 
*t 1 ~ 2 1 2 © E ?iJ T * 5 7 $ y £ £ *| ( I 1 0 f li I 
~ B) ) *f * % m % \,\ T > Z ft t. © 

DNA SlUotn- ^n5 4 1fi)!)!H7f F ( f tt 

J; 5 ttS«fitt**t«RO 7 5 y K © ^ < o 
o ^ T £ £ „ g & N ft JD f 0| {t o T U ^ (H 
Jfe m 1 3 # Ji ) o r7 5yREJtl*<SIKW 

--- J i^i :ii#J6LTU«, A: i *. tf N COS 
IOI1IB07 ; ;S (Met ) i^^LtUUCti 
^ U O 7 ; y 1? E 3*J © $ fb £ «k 5 # U ^ f * K 4' W L 

«»*cia^*ftSo'<cfe^ & # u ^ 7 Kin- Kt 

* #IMJDNA HI*. E?'J## 1 ~ 2 5 ~ 6 (Hi- 
20StF$l 3-150) £*lfctt«KB®ttSE*l 
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DNA II (1 ) ~ (18) tt^HIA»l*3-Ft« 
it a ¥ ( cjjj i * € ) COAaWttWtt^ ft 

?| j § Agrobacterium aorantiacus sp. nov. MK1 $. tz 
it Al ca 1 i genes sp. PC-1 J; H n - ; > n fc c r t W 

HBOA*»6B (E^iJ#-tl©7W 
Iff 1-2 1 2) 4fctt»l 3-1 4H©A*6B 
(E^iJSf 5 ©7 ; ^ iff 1 ~ 2 4 2 ) * T* <D 7 i J 

ft £ DNA c r tW it ? Itt ( JK T CrtW t k 

Pf iO {* ^ 0 - -f*y>^ C i8 - i o n o n e ring) ©4tt® 
>^L/> (methylene ) 1> (keto) £ £ & S 

# a ^ > C >S -carotene ) HIi Lti^y > (ec 
hinenone) 4gT* > (canthaiantnin ) 

^Ht5ilgttt*5 ( ft 1 1 H # JR ) o 
c r tW ig fif iftl^ 3 - b K d * •> - 0 - >f * y > ^ 
C 3-hydroiy-y3 - ionone ring) O 4 fi© ^ f 1/ > ( m e t h 
y I ene ) IHt« (keto) SlCfig|**B&ffittt>Wl 
T45«9> ^©Ift««f!©lo^, •*? 7 * If > * > (:e 
aianthin) ill i LT4 - ^M7 + t>f> C 4 - k e 
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t o 2 e a i a n t h i n) ^ItTX^f >f > (astaianthin ) 
^StSSiiStT^ (mi lBtft) o & % c 

— K <5 D N A f£*»6*lT^tt^ofct>O7*0% 
C © # 'J ^ 7 f. K t fz it C ft £ a - K * <5 DNA fi tt x Si 

- * y > ^ 3 - t k n * •> - ys - j * j ymtm * 

x Agrobacteriom £ A I c a I i g e n e s Fh1 © C r t W © * * 
o s; - \t N 75/ilJiMA? N 8 3 % © 7 x > f 4 

— # > # 36 £ fiR ID B E rw i n i a © # o f- y K £ jffc it 

£ x E n i n i a © c r t E N cjJJU c r t U c r t Y it ^ fi % ^ 
BB«®ft£*H;:l- *n*>£gtg£4s. N E rwi n i a 
g> c r t E ^ c r t Ik c r t U c r t U c r t Z itfe^(i s ;*; H ® l£ 

«fc lu 12 © W0 9 1 / 1 30T8^f & §1 & fg # J& ) 0 LU«,T, 
CrtW O S I 1^ U b E r ff i n i a © c r t & ■? 1$ {I J; »j 
i^$n5®f> ±12© Erwi ni a © c r t jg £?5i|(j 
*BI§oft4ftc $ 6 1: cnf g £?^IAt2 <!:> 
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7 * V > * y m&$ktk®}T' it , 4 - >r 

■5 K ft Z o 

frggg A&fStitfc? (crtZ) 

DNA 0 (19) ~ (27) li*iI#AgI^3 - Kt5 
it ^ ( c_r_t_Z £ift£) T*5, COli^ifl^ & # 
Sfl II Agrobacteriam aorantiacas sp. dot, MK1 £ fz li. 

Al ca 1 i genes sp. PC-1 £*)9u-->yifc c r t Z ag £ 
^ T & D . »2H©Cfr&-D ( E ?'J #"# 2 87 ; y IS 
fl-162) *fcttS&15EI©Cfr&D ( E ?ij g ^ 6 
CTWilf 1-162) *?07 WIIH**t 
S # U 7 * K £ = - K "T S & S E 6 S DNA it 

* -5 o c_r_Ll &B=?m® ( JK T CrtZ l*> 0 
- (/S-i-onone ring) ©3ft©£c^Jwlo© 

•+S**ffitt**lT:fe»>^ * © H # W tt 
M © 1 o /8 - t) o f- > C iS -carotene ) ^IIH 
T & - 9 V •? h * V > * y C >S -cryptoxanthin) igt 
-tf 7 * > * > (ieaianthin) *^)St SSigft-eife 

* (HI 1H#I) o $ & fc x crtZ ig fifl^li, 4 - 
y h - yS - 4 * ^ > 58 (4-keto- jS - i o n o n e ring) ©3 
ffi©K*fclo©#»«*#to*S**ffitt4>WLT:fc 
Ox f ©I#65^i!)lofr\ (cant 
h a x a n t h i n ) LT7i-: + t>f > (phoeni 
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coianthin ) £gT7X**-y->?-> (astaianthin ) 

z&i&tzMm%&T*$>z cmi ibid o tits, %. 

t S DNA mH, f£*»&*lTlr*ttfrofcfc©T?*So * 
> Agrobacteripm 43 J: Alcal igenes © CrtZ > 7 
S J 88 SE ?!l M A f , E r w i p i a oredoTora © C r t Z £ 
* tl ? tl N 57% £.fctf 58 %©7><x>x^ x - £ 
5 ^ t> t d - §f L o tt n Agrobacteriam £ 
A I c a I i g e n e s Fh| © CrtZ © * * o - it s 7 5 7 g£ E 
n U K ;U -p „ 90%©7f7 :f >x>f t w - i l» 9 S ^ * t 

# 3fe £ (3c IB fi E r wj n i a © * u * J 4 K & j£ it £ £ 
E ^ £ c £ fc «k *) > *!§®l|©®:£$iic£- # d * > £ 

' C tl \Z c r tff £ flu x. £ £ , ^11 I (D || | |; A >^ + 

5 o L # -3 T * Agrobacter icm £ tz It Alcal igenes © 

CrtZ 0 1 I li , E r w i n i a © c r t B » c r t B N c i t K c r t Y 
*£? ( /8 - *of % <fc , C ft £ ic 

Agrobacteriam 3 It A 1 c a 1 i g e n e a © c r t W it ^ £ Jrj 
*.£*>© (Ji ><D£.&) lc«k*3#:i&Sti£ 

© T > : tl b O c f t it & =f-m*£ti±®m^C[)WL&%!l IZ 
Agrobacteriam t t: it Alcaligenes © c r t Z it & ^ £ 3| A 
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U J b« >3gf{b&3jtigfc? (crtY) 

(E?iJif307;;tif 1- 3 8 6 ) ttOTU 
m. E 81 * 3 - K * 5 DNA m it , !) ae^lftB** 3- 
KtSlfi? ( nil i * * ) 7*5o C © H 1! ft #1 fcU 
?g # flB jj Agrobacterium aarantiacns sp. nor. MK1 J; 
Ho-i>/Lfc ml it * 3 «k 4 H 

©EjfcSF*T®7 $ ;!EJlJtlt« *H^f K * 
3 - KtSSSEaifr&fcSDNA 8! 5 „ cj_U it & ? 
1 «7 ( £1 T C r t Y £ t> Pf ) ft ^ >J 3 h° > ( I y c o p e n e ) 
^ Sf i LtiS - ij u 9 > (/S -carotene ) £ £ * 
**Stt*ffLT^« (mi 1 EI # JK ) o # # £ fifc ID K 

E r w i n i a © 1i u * J 4 K^Efcitfe^^JB^aCiKJ:^ 
*»»«?©»£«&£ 'J ^ t 4 > £ ft £ * : i4«TM, 
3" *> $ „ E r w i n i a © c r t c r t B ■» c t t I it ¥ li > ^ 
BB^<Ott4'ftR:U3tr>^Mtl*^iL5 ( ^ 1 0 El £ 
«fctfiiufE©W0 91/13(n8-f&P3&?B#il) o LfctfoT* 
Agrobacterium © C r t Y (D § g li , Ertinia P ert it 
?|ia!)ftfea5<!)T> ± E © Erffini a © crt ig £ 
?#*£fc*»B*©tt&ttfc Agrobac te r iom © c r t Y 
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ft £ \ Agrobacterium © CrtY {i > 7 ; >/ | E Jlj 1/ ^ 
;U "C x E f ff i n i a uredovora © CrtY £ 4 4 . 3 % © 7 -f x > 

£©&1i$>M#"einI-T&£ (11 00. ST1 1 El # J& ) 

o 

•g & 5 A g r o b a c t e r i a m aurant iacas $p. no?. MK1 <fc 
& Alcal igenes sp. P C - 1 it „ TEOi^ttllWttS 

* ^ 1" o 

< Agrobacterium anrant iacus sp, hot, MKl > 

( i ) mm 

S • * S 3 : # > 0.9 jec ■ -x l. 2 n 

3 id : £ 0 

: *> 

M <D £ tit J& : U I 

m =? © t& m : * i 

7 A ft ft : ft ft 

(2 ) 

: #S£ffct£-eft2R£W1-<S. & © 3? 
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«H-*#«l:«»i#^-ru4tU ft fe * * * o 
A H- H? 7 f- > ^ m « S : fl?L*'t'iC>i:«ffii:4tt«, 
(3 ) £S#ttftft 
5g it £ 0 it 5r: : IB ft 

* i > s © m m : it ft 

fg © £ liS : lllt?§ft©#tifefefg 

: n ft 

: H ft 

^ - y^a^-lfgtt (i^'J/^lStt) : H ft 

/S - # 7> h ^ - H? g ft : ® ft 
£ W © i§ H : PH5 - 9 ^ S$ 10 ~ 4 0X! 

: # ^ ft 

«7K»ft : lift 

0 - F f X h : gg {fc 

« ft © R ft tt : 

Btt:D-/A/3-* % D--7>y-7.> D - # 5 ? h - 

* D - 7 * * h - X > aH> ^ ^ H s 3 It > ^'J3- 

y > N N-T -fe f" ;i>-D- ^ ;u n If 5 > 

ltft:L-77h*y-^x D - v y - b -iK <f / ;/ u 
- )l> „ L - 7 a y - X N D - V ;U h? h — Jl 
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n& : a mm 

fit ft : 9 x >mm, U > =f It £ , ;U □ > $ £ % #7 
•J > H #i % a " ? S & <» 7 i? h° > 1 ^ 

H ft : J is > . * y 5? > , / ^ 3 - X -1 - 

/ ^ 3 - x - e - y > s 

• It ft : * 7 f- > > L-TJK-^, DNA , * 4 > 

< Alcaligenes s p. PC-1> 

( 1 ) JBIB 
lOl'^U 1.4 in 

Slfrft : $> K) 
m% : M € * «9 
fffl A3 © £ fig : tt L 
IS ? © % fi£ : tt L 

ft £ £ W * -5 o 

: 111*1^-1:41 L, « fi * w * 0 
(3 ) 4if Wttf 
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m ^ © £ & : 

Jj * y - : I® & 

£ W © IS ffl : P H 5 ~ 9 \ &m 10 ~ 4 0 °C 

fc*S& : Utt 

0 - F t Z V : M it 

y h*nf K*^SJt5 :i*<8§SftT^5 (Yo 
koyama, A., I j n m i d a . H. , Miki, W. , 'Marine bacteri 
a produced astaianthin'. 10th international s y m p o s 
ium on carotenoids, abstract, C L 1 1 — 3 . 1993) o BU i$ 
Lfc&ffflHBi Agroba-cteriam anrantiacns sp. nov. MK 

1 £ fz It A 1 c a 1 i g e n e s s p . P C - 1 © c r t it ^ © ft © 

n^*^D-^i iTffl^ftlf, © * * n - £ 

* 5 o ^ 98 % ^ n A g. a u r a n t i a c n s s p . no 

v. MK l © c r t ff <h c r t Z # D N A SrfrfcT'n-T'i 
L T N A I c a I i gene a PC-l ©$fefe#DNA S> N 36 < 'W 
y'J^WXlJ-SDNA ir it £ L T % _c_r_JJL i crtZ ft £ ? 
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romon as S D - 4 0 2 £ 

S/kT*^ Cft^&JfefeflcDNA t, Ag, a u r an t i a c 

l± s p . nov. M K 1 © c r t ¥ t c r t Z £ £ tf D N A (ft K" £ ~f 

C©7D-7teC07g##fflM©&fe#DNA C^tS^' 
^'UM7')^XLfc, * g m K «fc « DN A- $1 ft % 
- © «k 0 fcflflHDNA gg (2 ) , (4 ) . (6 ) , (8 ) , 
( ID . ( 1 3 ) . ( 1 5 ) , ( 1 7 ) , ( 2 0 ) , ( 2 2 ) , 
(24) <fc (26) £'W7>J*WX1-*DHA SI £ S £ 

DN A § © % § 

±!2©£g£3t©7$ ; S E ?ij ^ 3 - Kt«iSG9!i$ 

: ©Ifcf ^JliS J; U t Agrobacterium a n r a a t i a 
c g s s P . nov, M K l £ ti it Alcal igeaes s p . P C - l © h 

— ^ DMA *«£tt«H**-ej||ftLfct>0£B^-c*' 
lBT?5^7'J-*ftBLs : o 7 ^ y 7 'J - ^ ^ S 

fe^if 07>f T-fiffl$nT^^s, fc 4 it tf « a 
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jj§ II > Zymomonas mobilis N Agrobacterium tumefaci en 





I£ S f 1 i If 


Saccharomvces cerevisiae} 


© 




«At8 : i 

»J X N J ^* 




£ S5! 




V— A/ >» C. V 


0 






• ^ \ 5^ V » 




® fiE 




U T P • 1 t* 


i CD T ^ ^ « 






S * o fkik® 


N ^ 7h >N JLB. TE\ J ^ /\ <G cK LT^c 




£ to £D 


# IS ^ 1 15 


# £± ± ^ Bfl ^ u r T f ? i 




5 £J ft 


O O H £ ^ 




HE 


$ tl T 


^ 3 *> O £ # 


* n ( *: £ 




' Ve c t o 


r s for c I o n i 


d g genes', Methods in Enzymo 


logy 


, 216, 


p. 4 6 9-6 31, 


1 9 9 2, Academic P r e s s % J: 


s '0 


t h e r b 


a c t e r i a 1 s y s 


tens', Methods in Enzymo logy 


, 20 


4, p. 3 0 5 - 6 3 6 , 1 9 9 1 


, Academic Press # M ) {c ^ 


C T 




ft I* £ o 






< *H 


■ > 







j£ i)< * 0 N * ft * J8 V* T ft A. <k ^ ( fz t A. ^ S a m b r o 
ok, J., Fritsch, E. F. , Maniatis, T . , 'Molecular 
cloning -A laboratory manual.' Cold Spring Harbor 
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Laboratory Press, 1989 £ JIB ) e 

? © & 9 ^ & $ o T ft *- tf J: & ( fc £ *. % 1^ 
© 'Molecular cloninj -A laboratory manual.' #M) 
x *:££(£> pUC ^ p B 1 u e s c r i p t X9<Dl.ic <D ~f u * 

T'Mo 56 &J3 * m it ^ 1 a c © ZfuZr-f-lgZ^tZ* 
MWm*??- pBlnescript II SK t fz it KS £ ffl ^ 
T * ^n^e-^-og^© 'J - K * X> - * £ # S 

# IrI „ Agrobacterinm aarant iacus s p . dot. M K 1 © 
c r tW N c r t Z ^ c r t Y g fe ? £ A 1 t a I i g e a e s s p. PC-1 
© c r tW % c r tZ ig ? £ # A L s CtLb©^^^^^:!!® 

t? in a $ * o 

W- § Saccha romyces cerevisiae ^©^^iffe^©^ 

* ft * E ^ T fif * tf J: V* ( £ *. x Uc iJj *§ - M & 

it £ T O i i PGK ^ GPD ^©^n^E-^-fe.fcrj: 

#*itfc?£C©7n*-* 

b £ ^ S. c e r e y i s i a e ©^?*- N £ > j r p ^ 

(&«&&#© ars mn^mmm^ttzmmm^Ji^ 
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n £ - ^ ? * - ) . Y E p m » m D N A <D^M 

fijft*»o*ftfflT^^ae-^^^-) N Yip % ( g 

« o tt « e-jft * » tt ^ » * ft # a a * ffl ^ ^ ^ - ) 
* > * - *u B*isft^«nc s/ u - x r mnikm® 

iz*b<D&&=?X¥l «**J£flU fc«fctf N Yamano, S. , 
Ishii, T. , Nakagaia, M. , Ikenaga, H. , Mi s a w a , N. , 
'Metabolic engineering for production of £-carot 
ene and lycopene in Saccharomyces c e r e v i s i a e ' . Si 
osci. Biotech. Biochem. , 58, P. 1 1 1 2 - 1 1 1 4, 1 9 9 4 # 

ffl. ) o 

< Zymomonas mobilis > 

x 9 J - -«/ 4 1 i I Zymomona s mob i 1 i s 's CD ft 3fc if 

K) 'il 0 Z. b. % ^ Zymomonas mobilis T? © ft 3fc it ? 

0 |g S ii > tz b x. \f Zymomona s mob i 1 i s ffi K ? ^ - p Z 
A2 2 * 1 T.ff K i * (*tt3£a* r Zymomona 
Lil©^?WlJ > B^gSl^^g, 63. p. 1016-1 
018, 1989 „ is «fc tf^ Mis a? a, N. , Yamano, S. , Ikenag 
a, H., * Production of >S - carotene i n Zymomonas mob 

1 I i s and Agrobacterium tnmefaciens by introdaction 
of the biosynthesis genes from E r w i p i a nredovora' 

Appl. Environ. Microbiol., 57, p. 1 8 4 7 - 1 8 4 9 , 1 9 9 
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1 #JKO o 

< Agrobacteriqm tnmefaciens > 

II % M M $H M Agrobacterium tnmefaciens ^ <D ft. % 

<fc!3{T?Ci^t^ Agrobacterium tumefaciens <D 
^^itfefOlliii, £: t z. li Agrobacterium tumef ac 
j e ns ffl ^ ? * - pBI 121& m ^ T fir ? C t i)< V § * (Mi 
sawa, N. , Yamano, S. , Ikenaga, H. , ' Production of 
£ -carotene in Zymomonas mobilis and Agrobacterium 
tnmefaciens by introduction of the biosynthesis g 
enes from E r t i n i a a r e d o t o r a ' . Appl. Environ. Mic 
robiol. , 57, p. 1841- 1 8 4 9, 1 9 9 1 £ ft ) 0 
W&tyK & y Y 4 ^ £ g 

>**S»it4<r h*nf ;^ K £ fifc » ft * S 1 

7 r A> * i/ )U \£ u y >n ( F P P ) ii 57 n f- y >r K £ 
£ < N -fe x * x > % I* 'Jf;K^ x f d - ;U N 

x ;u ^ y -r Ktt^jSL'Ti»«o-e, 1" ^ t © m ^ fei (i % 

g # ftfc ^ *r^ft^S^iLT PPP^f (if 
& -5 o — # n # 3fc & Iffl S E r f i n i a O * n f ; ^ K £ $ 
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it £ ? i¥ ^ FPP & g K £ I T N Agrobacterium a a r a n t 
i a c n s s p . n o y . U K 1 £ t: it A I c a 1 i g e n e s s p . P C - 1 © 
c i t ilS?il!lOSI> t tt t> % > 'Jaf>, 0 - -h 

z (mi o@#i) o % m # m » . ^iifc*intt<si 

8H L B £ ft £ x & $ Saccharomyces c e r e v i s 

i a e x x^y-;l/£gjfflg Zymomonas mobilis e Ifc ® ffi 
MtiSlW Agrobacterium t u m e f a c i e n s IC E r i i n i a CD c r t 
lfi?»*IALx.:n6fl)i4i*<, ¥ m £ £ (3 „ 0 

£ * > t?i:lBLT^i (Yamano, S. , Ishii, T. , N 
akagawa, M. , Ikenaga, H. , Misawa, N. , 'Metabolic e 
ngineering for production of ^-carotene and lyco 
pene in Saccharomyces cerevisiae *. Biosci. Bio tec 
h. Biochei., 58, P. 1 1 1 2 - 1 1 1 4, 1 9 9 4 > Misawa, N. , Ya 
m a n o , S . , Ikenaga, H . , 'Production of yS - carotene 
i n Zymomonas mobilis and Agrobacterium tumefac i ens 
by introduction of the biosynthesis genes from Er 
ff i n i a u r e d o y o r a '- . Appl. Environ. Microbiol., 5?, 
P. 1 8 4 7 - 1 8 4 9, 1 9 9 1 > * «fc & » * % BJ§ # & iz <k * ft # til 
IE # H ¥ 3 - 5 8 7 8 6 ^ & « ( # H ¥ 2 - 5 3 2 5 5 # BJ $ § ) : 
r*Df y ^ K © £ ic *r Jfl « d h a tg J ) o 

L :&< o T N Erwi ni a ^^©rtn^y^ K £ lit it & =f- 
S¥ £ #fg BJ§ DNA m (IIS<Il:l^ Agrobacterium anrant 



-31- 



WO 95/18220 



PCT/JP94/02220 



i a c n s sp. dot. M K 1 tz li. Alcal igenes s p . P C - 1 

# © # d ^ y ^ K & $ it £f ££B*&fr-&T|5l-© 
?»AI65^*<«titT^«-r^T0»4'Bjlc x 7 x * 
t6^tt«ttf*e*«o & T ic N ^fi^hanf-y^ k o 

£ - # n f- > $ ic $ & e r t i p i a i r e d o t o r a 0 c r t E ■» 
c f li s c r t U c r t Y it £ ^ M^A^f ilfe? 

* 5 # IB 9! DNA m (1 ) - (9 ) (Di»f ft^ljfill 
A IS ( & 32 fill (r fi Agrobacterinm aarantiacus s p . no?. 

MK1 Z fc ft Alcal igenes P C - 1 0 c r t W it fc ? ) 
■ B*oi4»i:$AL*as*.« :iKJ: t)^ M^M 

^ >^4i*tKi^Tt5o ±IBDNA H ( uj_W it 
£f ) ®!fcSi'^;l'0!Bift*c*i*^f'$-38£4&0ig£ 

Erwinia nredovora 0 c r t E •> c r t B ^ c r t U c r t Y it 

^-pAcicm a i tz •? 

y * * K p AC CAR 1 6 A titK J: & N Ag, auraat i acns 
sp. nov. UK 1 0 c r tW it & £ £ fr Bf Jfr & ± Bi M /< 4 
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* - p B I o e s c r i p t II SK- icftAlfczfyXi K p A K 9 1 

6 k £ * j§§ m J M 1 1 1 ic 9 A U ^n*£« 

II^IU # n ^ y ^ K ft * * a ffi L 

/To &&$ft/c£^©M%tt#>**f->f->T'£>t^ 

6 %lix**y>T&^fco £ £ ^ #>**U->f->© 

CiltN *II2 'J » h 3 ij o to 

E r w i n i a n r e d o v o r a © c r t E N c f t B » c r t I > c r t Y it 
mft*±mW<t ^ — pACYC184 KiAtfc 

7 5 X $ K pAtCARl 6 AcrtX^ £ <fc > Atcaligenes PC 
- 1 © crtW m mftZKMW*? ? - pBluescr 
ipt II SKI i:iALfc^7^ ; K pPCU-3 Op|y 7 

L v. m#*m#>. ii u f- J 4 h'fet^ilULfco $ Hi £ 

* y > T" & -o 0 S&OIOK * d f >T 

i "5 t o 

<7X^+t>f>> 4-4-h-t?7*1f>f->©£M> 
t' 7 * f > f > "a- fi£ IC ift M tl E r w i p i a u r e d o v o r a © zj_ 
J_E^ c i t B % crtU c r t Y x c t tZ it ^ <i; >7" FSIA 
^^ita?-e* 5 #&HJ!DNA «[ (10) ~ (18) © V^ftl 
1 Ig © DNA 0 ( & M tfjfc fi Ag r o ba c t e r i am a o r a n t i a c n 
s_sp. no 7. MK1 t tz it Atcaligenes PC-1 © c r t W jj 
£?) **ll«<Dl4*KiA't*ajt*Kii:J; 

*> ^ ft -ii i®HT7^^t>f^ * ft i» m m t 
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±IEDNA m ( crtW itfe^) 036Sl"<;l>0ISt|f<e>c:ft 
*St*»4ttO««*#Ott»*CJ:t) N 7 * * * t 

09 * # JS $ ft Ir* o 

Ertinia oredovora g> c r t E N c r t B N c r t U c r t Y >> c r t Z 

afc^fc^fcBtffc^BB^J-pACYClM A 
I rt: 7 v X S K PACCAR25 A c r t X N & <fc tf^ Ag. a a r a n t 
i a c n s s p. no v. MK1 <D crtW igfc?£#fr|8rK-£;*cfil 
B^^-pBluescript II S K - K. W A L ti 7 y x i K 
P A K 9 1 6 OS^7^ K£*BB JM10nc*ALx * ft 
^^tSStT^IU M # £ 31 tf> > *d^;^ K fe * * 

tiES L fc o JfifcBSftfcfe^©?^, 7X**l*->*><!: 
4 - ^h-t?7*1>->f-:/0JKfite^ * ft ft > igg&2 
U »/ h ;!/ in <o 1.7 n g n 1.5 a g T & o rt: 0 

Erwinia uredovora g> c i t E » c r t IU c r t U c r t Y > crtZ 
Bttfc^BB^^-pACYCm ic ^ A 
L rt: 7 5 X 5 K p A C C A R 2 5 A c r t X . J; ^ A I c a I i g e n e 
s PC-1 O crtW afi?£Sfr»f)t**Bg'< ^ ^ — p B 1 u 
escript II SK+ fc#AL£:75X = K p P C 1 7 - 3 ®M 
7 5 x S K * * B B 1 M 1 0 li: m A I ^ ZtitfeftMtZ* 
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* *l ^ ft > J§ g & 2 U 

$J 1 I[ T'i fco 

£ - ij u f- > £ $ IC & E r w i o i a oredovora O c r t E > 
crtlU crtU crtY g £ ? £ ^hg^AS^itfe 
^ T' * 5 # II BJl DNA m (1 ) ~ (9 ) O^ftuMI® 
DNA H ( ft 1! ftfc W: Agrobactcrium a a f a n t i a c n s s p . 
107. MK13 fz it Alcal i genes PC-1 <D c r t W g ? ) £ 

A VX*£?-C* * *»HBNA « (19) - (2?) 
O^fttfrl^ODNA (ft ( ft M ft ic {* Ag. a a r a n t i a c a s 
sp. nov. M K 1 1 it A I c a I i g e n e s PC-1 g> c r t Z g ? ) 

xi3 + *^>$4IH5Cii<7f ± IE D N A 

@ ( crtW gfc?fc J: c r 1 2 g ? ) 

Eftinia oredovora 0 c r t E N c r t B N c r t 1 x c r t Y g 
T^^t;»ffr^^liM'<^^-pACYC184 if A I 7* 

K pACCAR16 A crtK £ «k N Ag. a a t a n t i a c a s 
sp. hot. UK 1 <D crtW % c r tZ g & ^ £ £ t? »r #" * * 
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HB^^^-pBloeicript Ii SK- fcjfALfc??** 
K p A K 9 6 K (Dm??** K**»S JMlOllc^Ats * 

<h7x^n+1J->f->©JK*tt> * *l * *l N ^ g & 4 'J 

If a*«F£jJrfcTE©*D * ft £ ft T i,* * „ 
(i ) -Agfobacterinm aurant iacus s p . nov. MK1 
SKtf : FERM B P - 4 5 0 6 
g ifc 4£ £ B :¥JS)c5 £ 1 2 £ 2 0 B 
(ii) Escherichia c o 1 i JM101 (pAccrt-EIB, p A K 9 2 ) 
$ft #"t : PERM B P - 4 5 0 5 
gft*f£ B : 5 S$ 12J3 20B 

(iii ) Alcaligenes s p. P C - 1 
fFIE#-t : FERM BP- 4 7 6 0 

^ B : ¥g!l6f 71 2 7 H 
( i v ) Escherichia c o 1 i , £ : p P C 1 ? 
SffiS-t : PERM B P - 4 7 6 1 
gft *P n B : 6$ 7I27B 

1 
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cc-effl^&nfcatfoafi^fflgiiLHi&fcu nets 

£ ft T ^ if £ ti > ^ *P ft £ # & ( S a m b r o o k , J . , F r 
itscb, E. F. , M a n i a t i s , T. , 'Molecular cloning -A 

laboratory manual. ' Cold Spring Harbor Laboratory 

Press, 1 9 8 9 ) CS'i^T^S, 

mmm i ; %feftDNA © nag* 

SfefeftDNA (is 3i®i$ll> Agrobact 
erinm a n r a n t i a c n s sp. nov. MKK A 1 c a 1 i g e n e s s p . PC 
-K Rff Alt crononas SD-402 #c (Yokoyama, A., 1 z n m 
ida, H. , Hiki, W. , 'Marine bacteria produced astai 
antbin'. 10th International symposium on carotenoi 
ds, abstract, CL11-3, 1993) & H 8! L fc 0 C ft b © 

mnnm z 200^ >j u * h ^ (mi ) © ig n& cdifco tt 

8! " Marine Broth"- *K9!*Eft®$&fi: «fc D 91 Si L 
*:£«!) T-> 2 5°Cs £ir&3T4 BMjt*3'*fcHtt& 
H ® & x TES $ ® & (20 iM h U * x 10 mM E DTA> O.l 
M NaCU p H 8 ) T«9U 6 8 t T? 1 5 # F3 M & 9 L ft > 
5 mg/ml ij'/f-A (iltflil) £ 1 0 0 0 g / 
ml R N a s e A ( > / t ft 8! ) * £ t? I « (50 m M ^ ;b n - 
* > 25 mM h'JXs 10 mM EDTA, p H 8 ) K m ffi I a 3? 
tT-lBtRgf'^ + A^-blfc&N 250 # g /ml \z ft 
ZX^lZ-faf^i--^ K (Proteinase i N K - U > # 

- . ■? > ;W A S3 ) £ Jn *. x 37^-eio^F^^^^^^- 
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Kfco <* h 1= . fi ^ I g ^ 1 % I: 5 <t 9 i: f ^ n •> 
— )l C N-Lanroy 1 sarcos ine Na i/ ? v & ) £ JO* X. > 
£ < S & L & > nt-ei^ri'f > + a^< - Mfco § 
b . 7*y-^/^Dn*^Aj|||ffl*tt|g|fofclft % 
2ft*©x^y-^*i>.o< t> JD & tt *< 6 * 4fffflLT§ 

'J > X I & > 2 ml © T E © i£ ( 1 0 mil MJ 7, 1 
mM EDTA „ pH8 ) i:^iLt> & fe# DNA ^ gg $ <h I „ 
H fife fl] 2 : 3X5 F7^7'J-g)fcfeCiig)ft|| 

(1) 74 H>14^iBOftl! 
E r w i n i a oredovora © c r 1 2 * d f M K £ $ 

ite^S^f t2 ; K p CAR 16 (Misawa, N. , Na 

k a g a w a , M. , Kobayashi, K. , Yamano, S. , Ijawa, Y. , 

Nakamura, K. , Harashima K. , 'Elucidation of the E 

rvinia u r e d o v o r a carotenoid biosynthetic pathway b 
y functional analysis of gene products expressed i 
n Escherichia c o I i ' , J. Bacteriol., 172, p. 6 7 0 4 — 
6712, 1 9 9 0 > J: #IMJ3 #£l;:«fc£#ifrtij|i#Ig¥ 
3-58786 -*t£fg ( 4# M ¥ 2 - 5 3 2 5 5 ^ BJ3 Sffl S ) : r*nf 
J << K © £ Sfc * JB D N A H J ) h Bs t E I I ( 1 2 3 5 )- 
Eco 5 2 1 ( 4 9 2 6 ) Sr * Sfc H rt: & > 7 4 U>i4i:|I 
ft c r t E t crtB jg fe ? S # ft 2.3 *o*ig*f (kb) 
© A s p 7 1 8 ( Kpnl ) - Eco Rl fc'Lfco * L 

T N C © Br £ * B V ^ * * ~ PACYC184 © EcoRV $ 
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ffifcffALs SWit^^; K (pACCRT-EB ) 
fz o C© pACCRT-EB£ ft 5 -kmm its n 7 

A7i^3-;Htt C C m r ) b x > £ £ g 

~t Z> (Linden, H. , Misata, N. , Chamovitz, D. , Pecke 
r, I., Hirscbberg, J., Sandmann, G. , 'Functional c 
(implementation in Escherichia coli of different p h 
ytoene desaturase genes and analysis of accumulate 
d carotenes'. Z. N a t u r f o r s c h. , 46c, 1 0 4 5 - 1 0 5 1, 1 9 
91) o 

(2) *) * V y mtk±Mm<D ft® 
Er. nredoTora <D c r t Z El ft 0 ii n J 4 K&J&itfc 
?^Ht^77; K p C A R 1 6 h B s t E 1 I ( 1 2 3 5 ) — 

Sna Bl ( 3 4 9 7 ) »r JS" * Ifc ^ fc fc> 'Jatf>S4K^S 
tt till s crtK c r t B it ? £ £ fc 3.75 kb A s p 7 1 8 ( 
Kpnl ) - Eco Rl Bifrfcfil&rtflfco ^ LT, C © i ft 
^^11^^ - p A C Y C 1 8 4 <P Eco RYgg&|;:jfAL> g 
J!lit5^7^ K (pACCRT-EIB) * 0 CO pACCR 
T-EIB taill^ Cm r 'Jnb^^il 

~fr & (Cunningham Jr, F. I;', Chamovitz, D., Misawa, 

N. , Gatt, E. , Hirschberf, J., 'Cloning and functi 
onal expression i n Escherichia coli of a cyanobact 
erial gene for lycopene cyclase, the enzyme that c 
atalyzes the biosynthesis of >S -carotene*. FEBS 
Lett.. 328, 130-138, 1993 ) o 
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(3) e - *u?>m£.*mw(Dftwi 

E r . oredoTora © c r t Z SX ft © fj o f- J -f K £ j£ it 
fil^f t 5 ^7 X ; K p C A R 1 6 ©#JIE8£^ B s t E I I ft 
ft n Klenow fragment ^-f^ r ~'>3>^l£;<£-fT , 5 
C <!: «k *j % c r t X jtfe?^71/-Ai/7 M: <t I) ^ g 
* ft % /S - *o^i4l;^|f4 cj_LE N c_r_t_Y N crtU 
LUJ &ti 6. 0 kb A s p 7 1 8 ( KpjJ ) - J_c_oRl 

pACYCl 8 4 ©_E_c_oRV^ffilC#A L> § W i t 5 ^ 7 X ; 
K (pACCAR16AcrtXi*ig) £ 0 C: © pACCARl 6 A 
eMl*#+**»Btt % C m r *^L > 0 - fj o f- > * £ 

S li n ^ ?§ ii (I*) 4fcr4K-u>jy---7>/N-fAtt 

gj|_g|_3 : J X 5 P7^f y 7 'J - OfjESife J; tffife^§ 
t-^^liS©g?# 

Agrobacterium aorantiacus s p. nov. M K 1 © & # 
DNA 25 ic *f I T N lii? hOjfilRi^ 8 a n 3 A ! 

£ ffl ^ 3 TIC x 157)F^O + a^ - h Lfc|, 68 °C. 10 
£ffl©&9-v?«HS**$cffi$'*fc 0 c © £ # 40 k 
b ft t& K £ < © _S_ai3A! fi&&£fl¥»rJt:&<»g,ftfc 0 C © 

— « * ffl ^ T n 3 X 5 K * 9 * - p J B 8 ( 7 > t? •> >J > 
I* ft (Ap r ) ) £_B_ajiHI Ilti7;H'j7^7 7 ^ 

- -tf © 9 L fc *> © x 43«ktfx p J B 8 S Sai l / BamH! 
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m ft '& £ 7 - a ( 'b S ^ # o Br Jt ) i^Afr 6iC Lfc 
t> © * in? * nx:*— tfe^-fy-s/avRi&fcffofco 

JC X !) in vitro /< y - 5? > ^ * > n X S K 5 -< 

C©7 7-'?fi?£*IIS ( Escherichia c o 1 i ) DH1 
(ATCC33849 ) > *5 J: x III 2 TftS L ^ 3 I© ^ 
5 X > K®£**W*3*»» D H 1 IC * ft 3 fc & > 

L. IStfll^^tLB (1 % h 'J 7° h > . 0.5 K f - 
X * x * * „ 1 % NaCI ) i:n-f orU 3 7 °C £ 

fi M & t? > -l^bSSISISlLfc, 

^-OlSm> tztz<D*mW ("*-5>afe) *3 <k Of pACCR 
T-EB ^ft57^ (^-^fe) ^ 

- - Kl ± x * U - - > / L fc # x feH#«fl:Lfct>©W: 
£ 6 ft tt o fc „ - 2f ^ pACCRT-EIB *f t« U ^ t°> 
M£*H® (9 tl^fi) £ P AC CAR 16 AcrtX£W 

t s e - * u * >m&*mm ( * ft ) ^tiu^x 

= P5^7 'J"? lis £ * x ^[ ^ 3 O - - |C 1 ^ © flj 
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£M-*-**BBJBHI£S!l*0H £A/£ti % p J 8 8 f:4l k 
b hu mo_S_aji3AI 8^^*lrfr*«#A*tlfc^7X5 K 
Z-SAj-Z^iZo K is x i£ O JzBR & * & pACCRT-EB 

Agrobacterinm anrant iacns sp. no?. MKl^fe 
ft D N A h 7 4 ;l/£j£it£?i?£|§§l;7 n - i 

> ^ t 3 i6 K fcU 4> « < £ 7 4 U^5|0*o 

Jl fife flj 4 :afefejK£ift«fc?eS*Sfr8rfr<P«/]Wb 
pACCRT-EIB *f t5 'J 3 t?>i4^i|JBl ( 9 t 
<k p AC CAR 16 A c r tX£ # ?" £ 0 - * of > 
g£*S§® DH1 (Jtfe) tt±Hfc3X5 F5^5 

o--^I^■e^ * © 7 7 x $ FHKfLfci;^ 1 
tt?:II^Tt^T % p J B 8 C33 kb frb 47 kb © Sao 
3A1 Sf7}7}»lfri<JfAn^57$ K # £ * *i T V* 

1= li > p J B 8 3. 9 k b © S a a 3AI S£#jB»rtf-j&<SPA 
S ft 7* 7 X $ K C P A K 9 £SftT^fco C 

ft i* * ^111:111, St A ®t )t © in Tito 7 1/ - > 

3>cJ:*)^ift$nfct)O-e*«iti6nfc 0 p a k 9 
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£t & © 8? #r J- i* x C © p A K 9 ^tf^i L 

pAK9fr b MM l fz 3. 9 k b ©_E_c_oRl J? A »f it £ * J§ ® 
^^^-pBluescript II SK + 6D EcoRI SB ft IC # A I T x 

x $ K C p A K 9 1 43£tfpAK92 t # £ ) £ rt: 0 C © ? £ 
7 7 X ~ K p A K 9 2 ©«II6B*i(IIia*»12BlC^-ro pA 
K 9 2 ^'j3k'>14^IIi:|ALfci:5, 7 X * * 

> > e& a mm m 6 ) laogfeno;-!^ 

*<x p A K 9 1 ^'Jnt^ii^lIClAlT^ «f ^ttt 

> K p A K 9 2 iz £ if Z> fe -* £ JSit e T 8 © # ISJ ft tt K * * 
- © 1 a c 7n*-*-©ffl£±Elt;T*5£%*.&*ifco 
-o^l-pAUl © P s t I ^ii: i *) ff itl'S 2. 7 kb© Ps t 
1 Brfrix p A K 9 2 frb BamHI # ff «k 0 % t> ft 5 2. 9 
kb© Bam HI ff Jt £ «fe Sai l £ft?l;:<J;>)*#$>ft*2.3' 
k b t 1 . 6 k b £ © Sal I Brfr££> ^ft^ft^^-pBlu 
escript II SK-lc^u-->^L^ 0 Cftjc£«9f|£ft 
ti Zf 7 X i K © 9 £ x p A K 9 4 ^ p A K 9 6 x p A K 9 8 x p A K 9 1 0 ^ 
p A K 9 3 x p A K 9 5 tZttVizzr?** K©*HJI®B£fSt*6EI£ 
mi 2B3(c^to 7 5 X S K p A K 9 4 x p A K 9 6 x p A K 9 8 x 
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pAI910ttfe*£fifcatfe : ?i¥®2f|R|tej&<'** * - © Uc 7° 
Dt-^-CfiitBUTi^ p A E 9 3 > p A K 9 5 T fi iS* 
T" tb 5 o 

2.9 k b g> fl a m H I K p A K 9 6 £ 'J 

3t'>i4^§Ii:|ALT{i> 3.9 kb Eco Rl &j ft % 

& ( M Jl 0J 6 ) > C02.S kb Bam HIffi © DNA @2 ?ij © ft 
Jg * fT -d o 

DNA IS ?ij © ft £ 2. 9 kb® B a m H I Vr Jt o ^ T IE $ 

0^**St5?o->CoV»Tff9fc 0 £ ft: ft 

I!(i41©y775 K p A K 9 6 > p A K 9 8 > p A K 9 3 s pAK9S 

iz -d ^ r kit q ^ mx- n -o ti o ttifn<o^575 k > 

lfl n s * All I t _X_b_a l 7x^-/1,/ 
* n n * ;U A & ffi £ ft lr> > i^y-^ftf&tcj^DNA £ 
HJlRL^o ^tl^tlODNA £100 # I ti> I z 0 I I I '< y 7 7 
- (50 mM Tris-HCl, 100 mM N a C 1 , 5 mM MgC12, 10 mM 
2- /)V1]'?Yx.9J-)\,, P H8.0 ) & J$? L , 1 8 0 a 

1 55- CT £ ic 1 fl n 1 H^'J >nt^ 2 If > 7* ;b # -f 
o x 2 0// 1 © U B /< y 7 7 - (40 mM N a - A c e t a t e , 100 ■ H 
NaC I, 2 mM ZnC 1 2, 10 X tf U -fe o - A, , p H 4 . 5 ) © A 
tz * ± ©f-a-7*(c^tfc 0 *>7*y>/|^7ft N 
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mz.xwvn$!Ei&uiizo £ & > r # o - * ^ ji 

S ^ & ®J ic J: »5 . 1 o 0 7 y X $ K K o V> T 5 I'O * ti 
etlfcfcOfiK^JIttSMA ®t Jt £ 0 iR I rt: 0 0JRL^ 
D N A ttKlenow fragment lcJ:»)*A8*¥7IMtLN 1 6 T ^ 

-ft?'fy-v'3>RjE;Uw«> * m s j m 1 o 9 ^ us 

- ?II13I07*J§V*T.1 **DIA *H1511n 7 zf ? J K /< 
f * •> X x A ( **) ©8*^7^7-*^ ^<>-^i 

^ x >t-*i^TDH4 S2 ?"J £ & 5£ L rt: 0 

*®*g£fc&ftfc28864B£*t (bp) ^ 6 S D H A E fll 
^15-90 ( E & ^ 4 ) lc^1- 0 §§ & n K > © ff ic 

. 7 i/-Aotfe*©ASlU 3lott > h 7 -f ;u £ j£ it 
=? c r tf % nt2x c r t Y © # £ ;&< ? & £ ft rt: & H C H SI 
00 8) i:^^^'^S*3 - 3cc^-^ 
>'J-f-f >r7l/-A (^5~9E*A.^bJ ( E 7>J 
#-^4©igg&fi229 ~ 8 6 4 ) x C ^ 6 D ( g g 8 6 4 

- 1 3 4 9 ) x E fr&P ( g 1 3 4 9 - 2 5 0 6 ) ) i)< & V* tz. 
£ ft £ 0 Z <D o *> k S> B x E fr&F ©2o©*-7*:/ 

GTG T * t> x C 6 D Ko^TttATG t Jfe 5 0 
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P A K 9 2 £ fc U p A K 9 6 *'j3tr>«4*»»IM101 9 
A I *> © (^Cli (pACCRT-EIB. p A K 9 2 ££<ipAK96)) 
(gfe^HTUS) x * x p A K 9 4 3 rt: li p A X 9 6 K 
(1120) Z 0 - ij n > 14^11 JM101 i:|AL 
fc t> © (^11 (pACCARl 6 A c r tX, p A E 9 4 £ fz it p A K 9 6 K 
)) (ttfe*MLTl»*S)*150 // g / a I O 7 > t •> 
•J > (*P> ifeHII) Ml 0g/ml©?D7A7i 
(Cm. = * £ fc 2YT Jg ft (1.6 % h »J 7* 

h>> 1 0.5 % NaCI ) 4 »J » 

600 ml (DTtbytz&vmtiiltio C ft £ ^ *j| & > 40 
0 ml <D9uu*i\,k/*9J-A, (9 / 1 ) T 2 0 Jfi 
tiils MW £ H t tz o £ £ K N :*l*/M© ^ o n* A 
A / y * y - Jl (9 / 1 ) » j» & % *frf$k$l<DftJiSL 
m S/ y * y ;l> T L C ^U-hfcJS^T* ?on*;l,A/> 

* y - ;U (15/1 ) Tl«t5:il:i^ SI^ot 
^77^ - (TLC ) Zft 1 tZo 7C®^fefe^(i, d © 
TLC K «k 5 » Rf fit 0. 72 > 0. 82 N 0. 9 1 © 3 * # v h 
iz 9i ft fz o ^fefe^^^©50%{c*a^-T5s g^S^ 

Rf 0.72 ©fe^i. ft I- 1 ^ R 1 0.82 ©fe^^ N TLC 

7* 1/ - h 6 x ;&> £ <h !9 & . ^iO^Dn**i/^ 

(9 / 1 ) *fcli^;-;H:ggL^ t7 7 f 

* X LH-2 0 *vA^nvh^57^- (15 X 300 mm ) 



WO 95/18220 



PCT/JP94/02220 



\z > 9 u u * )V J* / t 9 J - )V (9 / 1 ) t li ^ 

mg ( : Rf 0. 72 ) > 2 mg ( : Rf 0. 82 ) ft 0 
Ri 0.72 0fe*tt^ & # - Rl & * ^ * 1H-NMR. 
FD-MS X ^ ? Ml/ ( ■ / e 596 ) CilJ;^ 7X* + 

x * y - )l 5 : 5 : 2 K $ B I . CD X K * h A, £ M % 
I fz t Z 6, 3 S, 3'S c4ft«^KH<Hofc 
fc*!^ #®g£7X**1f->*:' (astaianthin N $ 

1 1 ®%m) ilil^Lfco 2 Rf 0.82 © St 
f* > ^^-Rjmx^^ h ;K lH-NMRs FD-MS X K 5 h )l> 
( n / e 580 ) ©£*£*)> 7x~n+U->f-> (phoe 
nicoxanthin > $ it £ IS 1 1 £ flf £ £ ft o 

Jl 0.91 <D&nteii>?*D->l-y-?&it: 
(HJfe0J 7 (2) ) o 

(1) 4-^h-tf7 + f->f">©H^ 

o t tt fc> £ n E r. u r edo vo r a © 4*of ; >f K & $ 
it H £ W "5" -5 y X S K p C A R 2 5 (Misawa, N. , Na 
kagawa, H. , Kobayashi. K. , Yamano. S. , Izawa, Y. , 

Nakamora, I. , Harasbiaa K . , 'Elucidation of the E 

ryinia nredovora c a r o t e n o i d biosynthet ic pathway b 
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j functional analysis of gene products eipressed i 
n Escherichia col i * . J. Bacteriol., 172, p. 6 7 0 4 — 
6 7 1 2 , 1 9 9 0 . fccfctf. #f£ W m h \z J; £ # & tB fig # fii ¥ 
3 - 5 8 7 H6 ^f&fg (#JS¥ 2 - 5 3 2 5 5 ^BJIfflW) : T*n^ 
H K© £JS& IC DMA iJ ) ©$ll|ggi3t_B_s_t_EII ffi 

it. Klenow fragment & g N ^-fy-S/al/RjfifcfrS 
CI i ic ck *9 ^ c r 1 1 ifif 4 7 1/-AJ/: h lc J; «9 & g 
Jni, ^7*t>f c_r_t_K c_r_t_K LL 

J_U c r t B , c r t Z £ £ ? * # 6. 5 kb A s p 7 1 8 ( Kpn 1 ) 
- Eco Rl BfJtfeflJOrtfLfco t IT, C © ®r £ * H 
M ^ * * - p A C Y C 1 8 4 © Eco RVgKftfcjipAL, g ft <fc f 
* 7 5 X 5 K ( pACCAR25 A c r t X <!: # £ ) £ 0 

p .A K 9 1 0 £ li p A K 9 1 6 (11 2 H ) £d©-t?7*1i-;'5 L 
> g £ * II g J M 1 0 1 I tz i> <D (^gg (pACCAR25 

AcrtX, pAK910£ fz tt p A K 9 1 6 ) ) (tfitlLT^i) 
*• 150 #g /ml©Ap£30 g / m 1 © C m £ # & 2 Y T ig 
UK 2 U v h ;U -C s 3 7t % 1 8 B$ p H ^ # L ^ o « S & g 
® I rt: S # £ . 300 ml OT-ir h >l:J: ^itfJLfco C 
ft £ ^ IS & n 200 ml © ? □ n * A / ^ * y - ;u (9 

/ 1 ) t 2 a a tB t n mm&m i tz 0 $ & & N :n*/h 

g©*oo*;l/A/y*y-;l, (9 / 1 ) » £ ft x ^ 
uu*s)\,j*/t9J-A, (15/1 ) TfgHtSCiKJ: 

*) , ^S^D^h^77^ - (TLC ) £ ft ^ o 7G © £ 
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fefelli, C CD T L C £ <fc D s R f 46 0.54 (46%) > 0. 
7 2 ( 5 3 %) v 0.91 (1 % ) © 3 * * * H \z ft tl tz Q 
Rf 0.54 © fe 31 £ x TLC f V - h & ^ i 5i> 'h 

»0?nu*;t>A/^*y-;u (9 / 1 ) tfeii^^y 

7 - (15 X 300 mm) \Z ft ^ ?aa*)l2*/t?J 
- )l (9 / 1 ) tfctt^y-^tgliSHltSCii: 
ids S-* 1.5 mg Ifco 

SttRfcfcttSa&fl-RTSX'** h FD-MS *H 
Y )l ( i / e 5 8 2 ) > £ ,£ > i/ U ii ? Jl> T L C 0 & id K 
(*on*;UA/>*y-;l> (15/1 ) TJg§§) # x 4 
-^rhHfT + ifl/f^OSllRfii ( Agrobacterium aura 
nt i acus ip, nov. UK 1 <fc «9 fit Si > # fl ¥ 5 - 7 0 3 3 5 ) £ 

( 4-ke toieaxanthiiu- ft j£S li SI 1 1011) i@SL 
o * x R f 0.72 <k R i 0.91 © fe * li * ft * 

ftx immm 6 ) x 

(#1*601 7 (2) ) T * * o 

(2)*>^^^>^>©f^l^ 

p A K 9 1 0 ^ rt: 14 p A K 9 1 6 £ £ - # n f- > M £ * M S J M 1 0 1 
IC 9 A L iz & © (^11 (pACCARH A crtXx pAK9103fc 
ttpAK916) ) (gfe^ILT^i) £ 150 jug /ml© 
A p £ 3 0 n% /ml©Ci£#t?2YT£*&2 U » h ^ Tf x 37 
"Cx 1 8 ft IB * * L fc o ltg&frt>3IMLrt:®#£x 300 

- 4 9 - 
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ml ©7-th>lcJ:t)}ftaiLfco £ £ ^ It & > 200 ■ I 
®>?uu*)\,2*/t?j-)\, (9 / \ ) T 2 [H fft ffl I N 
II & H I fc o $ & % :ft*/Mo^ nnt^A/y 

9 J - k (9 / 1 ) » # & s y^^ttfiO^KfflS/y 

# y ;H L C ^1/ - h*I^T, ?on*;UA/p<*y- 
A> (5 0/1 ) TggStSC ti:J:^ ^S^dvH^t 
7 A - ( TIC ) * ff o 0 ^fefe^^#£D94%(cffi^t 
5ifeI^feS^> TLC 7 1/ - £ £ £ o 0 $ 

& ic /j\ * © ? d o * A/ a / y * y - a, (9 / 1 ) t tz & 

*nn*;l/A/y*y-;l, (1/1 ) » £ tfc % -fe 7 7 x 

* X LH-2 0 #5A?ovh^57^- (15 X 300 mm ) 
IC if N ? n d * ;l/ A / y * y - jl (9 /i ) £ ^ 
oD^;i/A/^^y-;p (l/i ) T?gHjtFtB**C£fc 
<k 19 v do £ 3 m g ft o 

* ft » C * «■ £ & - rT £ X ^ * 1H-NMK FD-M 

S * ^ 9 h Jl ( n / e 5 6 4 ) ^ J; > •> V ij *f jl T L C 
O^l/g (*on*;l,A/y*y-;U (50/1 ) © M 
^ T* Rf 0 . 5 3 ) # *>^**>f'>©*l|lfl ( B A S F ft S2) 

nthaianthin ^ H i§ 55 £ 1 1 £ IrI ^ L <, * 

tSfell^ % 0- - Rl 'ft X K 9 h Jl ^ v> U -h *f Jill c © 
I& g (^Dn*^A/y^y-A/ (50/1 ) T-©®&§ 
T' If 0. 7 8) ^ <fc > y HI 6 /£ CI 8 ( 3. 9 X 3 0 

- 5 0 - 
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0 mm) ( * - * - XifcS!) £ m l»* tz HPIC© & M K (T 
■b l»-MJ*/^y-*/l^n/W-* (90/6 / 
4 ) T- 1 . 0 ■ I / ■ 1 1 ©jSg-e©®§g-eRT16#) <k 13 x * 

* y > (echinenone. M jg £ (i % 1 1 E # Ji ) T^i 

* x. £ ft 0 

(3) T * # > * > O m%. 
p A K 9 6 N K (SI 2B) * iS - #n^>|££*j|§MJM10 

1 fc$ALfct>© (*8§® (p A C C AR 1 6 A c r t X > p A K 9 6NK) ) 
(Jtfe*MLTV**)*158 g / ml © Ap <h 30 n 

/ ml© Cm£ # t? 2YT ig itfc 2 U y hbf, Hts lSttRSg 
g I rt: o Jgg&frS>H®lfcS#£^ 300 ml © 7 -i? h 
>icJ;«9ffltBL^o C ft £ g fit & N 200 ml © ? n o * 

;uA./^^y-;u (9 / 1 ) *r 2 isi tts t ^ ^ H S£ @ L 

(9 / 1 ) fc*ffH*-y/U*tt«©»WJ8$/y*y;ML 
C 7°l/-h£JB^T> 9uu*A,l»/*9J-fr (9 / 

(TLC ) to Stfefe*£#©nKl;:ffi3**«t> 

* If* ft * * % TLC :/ U - hfr & > * S £ o fc «, $ £> /Jn 

Ko'^DD^u/^y-A (9 / 1 ) * ii ^ ; 

7 -f - (15 X 300 mm ) KfrfcK 9nn*)lA//*y 

- ji> (9 / 1 ) t tz it / * ; - Jix* mm® no- z z t \z 

<fc «9 > ft & £ 3 mg# rt: o 
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S X ^ 9 h ;b ( m / e 5 6 8 ) % £ J; > U ii ff ;H L C 
®&Wl&. (tuu*)\<J*//*J-)l (9 / l ) -p © g 
H Tf R f 0 .. 5 9 ) # s -fc?7 + f->^>©^ip,a ( B A S F ft SSI ) 

T? x s; x ;l/ x - f- ;}/ : : x ^ y - ;b 

5 : 5 : 2 I^IU CDT.^^ h^HLfci 
31, 3 ' I ®3:#flia*i*Ci*<t)*>ofeysj6 N # % g 
£-fef7*-fr>*> ( xeaian thi n, iS^'till 1 H # JR ) 

'J * y ;H L C ®®n& i9uu*A,j*/*9j-b (9 
/ 1 ) "C © 5 T" R f 0. 8 0 ) > & <fc N y '< /< y t H R 6 fi 
C 1 8 ( 3.9 X 3 0 0 mm)- ( £ * - ? - X ft $g ) * ffi O fc HP 
LC©^Sig ( 7 * 

- ^ (90/6 / 4 ) T 1 . 0 ■ I / ■ i n ©i£g-?©jg§§-eRT 
19#) 9 V 7 h * * > * > ( /? -cr yptoxanthin^ 

(4) fi - ft u 3- y vnfe 

P A K 9 8 *'J n e>i4^fif inn fc$A L t> © 
(*mm (pACCRT-EIK p A K 9 8 ) ) (tfe^ILt^S) 
£ 150 # g / m I © A p £ 3 0 #r/ml©Cm£#t;2YTig 
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m I ft Z , 3 0 0 ml © 7 * h^fc^JfiHiLfco C 
ft £ M & . 200 ml©^ + f->ir«k?)2lHltttiiL^o $ 

/57V- (15 X 300 mm ) K 3> # > ^*if->/ftfti 
;U (50/1 ) V S §§ & Iti t 5 d <fc J: 9 . tt££3 mg 

ft *o 

SftftllfcttS^fl-RFfc*'** FD-MS 
h )\> ( D / e 5 3 6 ) > «k x y y ^ H R 6 jf C 1 8 
( 3 . 9 X 3 0 0 mi) ( 9 * - 9 - X tt U ) fcftUfcHPLC© 
^ S6 g ( 7 -fe ^MJ^/^y-A/l-^o/W-^ 
(90/6 /4 ) "t? 1.0ml /min ©i£&T-RT62£) # * 

# - n * > ®mm& (t-;n«7>7S> 2/ / v tt «) 

it^T-itfcfc)&> 0 - * u 9 y C iS -car 

otene > 1?g it 5£ I* SI 1 1 Bl # ft ) £ E £ I o 
ft % ft 8 :*»>h74;U&ftatfc3P8©ft£ 
(1) ^hgftAH^itfc^OH^ 
p A K S ( H i« 4 ) SfctepAm i: ^ t H 5 U kb© ®r 

ttt^T©ae?«^ £ffi'J©2. 9 kb Bam HlffirJt C p A K 9 6 > 
1120) ©*ea**iTV*SC£*«j*ifc0!6©ttftJ: 
& ft * T? * * 0 L fc # o T . £ ft © 1. 0 k b ft it « & ft 
V* o C CD p A K 9 6 ©2.9 k b_B_a_m H I Sf W © + JC 1 * jjf © Jj_c_o 
1 ft ft £ 1 * if © Kpn I ft ffi 0 C© Nco I t 

Kpnl ftffi©H©1.4 kkftfi- ( p A K 9 6 N K ) \Z it tK ft S ft 
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z t t<mmn i ( 3 ) ©eii^^So 2 * , 2.9 k b 

JLiiHIBrfr©4Ui 1 frjfr#£ l±K© JLloI £ J_p_nl $ 
& © ffl K & * Sa 1 I 3$ ft £ £> lc (2 C © S a 1 I SB ffi «£ *) _ 
©J_i_ncII M&frbl. 9 kbJTajiHIBr it © £ ffl © Br # £ Bfc 
^/tBrtf- ( pAK910fc <fc pAK916) C T 0- ij u 

( H JS 0J 7 ( 

2) ) . p A K 9 6 ©2.9 k b B a m H I Br it 5 _H_n c I 1 g& ft J; 
19 $ g K _ © _N_c_o I $fi^&£fl9©Brtf*Rfc^*:Brtffc 
It 0 - * n * > $> * y * V > * *s % & fct Z 
gtt««*Lt^fco pAK916©0.9 kb© Bam Hl 

- Hin di Br ^ T N ^©^O^E^tciyjjJf^g^- 

5 _Jji II «&*&_:«©»?#*£ ^Tt> x ±E© _B_aj 
HI -JLUcIl Br M* (PAK916) t ft m © S ft # # ffi I fc 0 
L *< •» T x pAI916rt©0. 74 kb © Bg 1 I I - jj j n c I I 
Br © * 0 - * Lt* + f >f > 

L % C©jfife^©4i{c±sB© Nc o I $ & 

©itfc^tC^^tSs B§ & 3 K > © £ tff U # 7 - A 8 
^ItittS • U-x^f > • 7 U - 

A*tHit5:i*<-elfc©T' N C ft £ crtW jg 
❖ € L o C © _cj_t_L.it fe*®£SEai£3- K S ft S 
POTWIEJJttllB (E^Jff l) (c 
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7£ £ ft X ^ 5 o 

d CD Agrobacterinm aurantiacus sp, dot. MK1 ® c r t 
W_it ^ g $1 (CrtW) li> £ - (/8-ionooe 
ring) (DAtfLOtfUy (methylene ) g £ 4" h (keto) 

1 o £ - # d * > C ^ -carotene ) *SSHti 
(echinenone) *gT* (cant 
h a i a n t h i n ) ££j£-TSB$flt&&T'&S ( H ifc 7 ( 2 
) > $1 , c_r_t_Wit fefiftlis 3 - 

t K n * 5/ - 0 - 4 * y > 3S C 3-hydroiy-jS - ionone r i 
ng) <DAtiL<D^tU> (methylene ) f (keto) 

1 o # x 7 * -fr > f" > ( a e a i a n t h i n ) ^Ili LT4 
- <r K7*t>f> ( 4-ketozeaxanthin) & g T 7 X 
(astax-anthin ) ^^Ut ? liStt?* 
S ( H 7 ( 1) . II 1011) o tt * . C O £ 9 

ttgSSt^ftS^M^f Kfc«ktfcft£=i- Ft 
S D N A ffitt x ft#»6tlT^ft^9fci©-p*!>^ d © 

flca^T^U'Ci^^'J^^f K '* it DNA g 

y > ^ * 3 - t K n * 5/ - 0 - U;>|i:R&fs 
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(2) *wtmmAmm&&?<D mm 

P A K 9 6 © 2. 9 kb Jj_m HI jf © £Hc 1 gr © Sai l $ fir 
*<l?2t5o £ © Sai l a5fiT2. 3 kb Bam Hl ft Ji* * 2 
o©HfK-K«IllrLT«»)ffl1-i % 2o0|}fr ( p A K 9 1 0 £ 
P A K 9 8 ) *SIIAgf Sitti< /i, t L| 

9o * tt t> *> x £ H © »r (pAK9lO) I: li F S f A i 
*Sttlfr#£Ltt< ( H H m 7 ( 2 ) ) , £ fflij o »r 
( P A K 9 8 ) Ktt'j3t>Ifl:if jgftL^#4L4^ 
( & tt HI 7 ( 4) ) o - # n ± E © Sal I M ft * # tr l . 
4 k b © _N_c_o I - K p a I Br C p A E 9 6 N K ) i 0 - ij u * > 

T N -tfT**>f->*^rtf*J;^lctt-B ( H il 0J 7 

(3) ) o I j&< o T > C © 1 . 4 kb© Nco I - K p n I #f 
© # K £ - *nf >$SIi LHf74t >f 

£?^'#ILs £©*£?©# fc±E© Sal I S ffi # # 
SLT^Si*i&ft«, ttSE*Jftj£©|g£ N C © it 
£ ? i: ffi S t 5 > B§ & n k > © it M in y ** v - a & £ gg 

^Hit 5 d £ ri< T? # £ © T" , C ft £ crtZ ig fe? i^g 
I o C © crtZ it £ ? © ig S E ?ij £ n - K 3 ft * * 'J 
^^f FOT Wl E*J I* H 2 E! (EWf t 2) ic ^ $ 

C © A g r o b a c t e r i b m aorantiacas »p. no?. MK1© crt 
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Z_it (CrtZ) fcU £ - << * J > m ( £ - i o n o n e 

ring) <DZ&®mm\zl'D<D7kmm%tflMtZWm7£& 
^ H*#»4flflloi«, jS - £ n f- > 
C >S -carotene ) i L T £ - * »J 7 h * * > f- > 

( £ - c r y p t o i a n t h i n ) £gT-tf74 t D->f-> ( z e a i a n t n 

in) ^^tsiiffiiti?^ immmi ( 3) ^ mi 

1 B # JH ) o $ 6 K x nil it ? M #1 ti > 4 - >rh - 0 
- W * y > W (4-keto- £-ionone ring) ID 3 ffiO^f 

£#Wtt'0i©loj&<> * > * * If > f- > ( c a n t h a x a n t h i 
n ) HIi LT7 i - 3 * If > > (phoenicoianthi 
n ) 4gT.77Hf >f> (astaxanthin ) £ & $ t 
£B£3tffi&T-£S ( HSfe #J 6 . $110111) , 

K 1" S D N A tt*»&ftTlr*ttfrofcfc©"C**o 
£ > Agrobacterium © CrtZ U > 7 5 ; 1SW M A 
7? x E r w i n i a u r e d o v o r a © CrtZ i 51% O 7 f > r 
7-4 - t ^ 9f|S^*t D JJ-^iLfco 

(3) 'j3f>l«lil£f<OBf 
p A K 9 6 V> T 2. 9 kb Bam HIffr)trt©£fffc&3Kpnl 

Itt^^lOlli-^ll^ilT- ( P A K 9 6 K) . $ & IC « 
C © Kpn I «ffiJ:i>:&R:*S P s 1 1 SB ffi >&> b £ ffi'J © Sr tf* 
£&V>*:»r>t (pAKM ) KtS^Xlt* 0- il o * > t> 
7***-fr>f-^l*&$T-£S;&<(!!jfe0lJ6) N «J =» t° 
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So -2fx 2.9 kbJUmHIKr © 4» ic 1 gfr # £ L ± E © 
Kpn I KftfttZ Sal I Sfi^^flOK^ 

£#tfl. 6 k b _S_a_i I Br)T-(pAI98)*U3fcr;/g£rt» 
I I: i A t S ^ 0 - ^^^j.^.^ 
(HiW 7 ( 4) ) o I tz ifi r> X ^ Z © 1 . 6 k b S a 1 I ft 

# # £ L > : 1^ ± IB © Kpn I £ Ps t I $ ft 

ifc^oTSttLTl'Sitifitli, S E ?'J & £ © 
g ^ CffiStS> H *& a K > o B W fc U # 

V-Ai^«fi*tt8 lo(D*-^> • 'J — x ^ y y 

& fc? £ ft % I t: o C© c r t Y it fc^<Z>*fi»EyiJ£=i- 

■K*ft5#'J^?*K07$y»EaittS&3~4BI ( E 
«Sf 3) i:*^Tl>5, 

ft £ x d © Agrobacterinm a n r a n t i a c a s s p . n o v . M K 1 
© crtY jt T g ® (Crt Y) li, 7 5 y » E JIJ U /< 
^ T > E r w i n i a a r e d o ? o r a © CrtY <£ 4 4 . 3 % © 7 x > 

H « ffil 9 SOgSfeftDHA £ © »1f > # #f 

ffi©»#a^tt©ftfeft±K:#|jtLfc c_r_tW £ c_r_t_Z t *g 

a * •« t«i**<»&ns^s^*ti8*tfco n * « i 
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•gUi&Lfc Alcaligenes sp, PC-l^ Alteromonas sp. SD 
- 4 0 2 ©&fe#DNA £ j<J m M m B a m H 1 J: P s t I T- fg 

ft L . 7^n-xf;HIMS^HU, £ T © # 
SfL^DNA ®r Jt £ 0. 5 H N a 0 H % 1.5 H H i C 1 © 7 ^ * »J * 
& T g L tf: & > -Rfrtf Tt-f n>^>7U>Kh5 
>X7r-$-a-/io D N A i)< mW I it i- << n > t > zf V > 
$: s\ 4 7 V jT 4 V - 9 s w > i&Mt ( 6 1 D e n h a r d U 5 x S S C > 
100 v g/m I s sDN A) ic g L s 2 HQ n 5 0t "C 7 1/ ^ >f 7 
ij^-f^-S/a>*.ffttofco & K % p A K 9 6 K ^ 6 Ba I I 
-etat) tti Lfc crtW £ citZ £^frl.5 k b © D N A »f ^ M e g a 
prineTM DNA labelling systems ( 7 ^? + A ) £ [ a - 3 
2P] dCTP (~ 1 1 OTBq/mmo 1) ifcJBHTSittft L n ± IS © "7 
I/a^7'J ^ -T -bf - v/ a JP;iT16B#faij,, 6 0 °C T* 

/W7U^-f-tf-$'«2'!fc% 2 x S S C x 0 . 1 X S D S T 6 0 1 x 

7Ts t *s 7 h * & Lfcg^ N Alcaligenes s p . P C - 1 T* 
it Bam H 1 $j ft $3 $1 1 3 k b N Pst I M ft ® T 2. 3 5 k b 
© fit 1 tc ^ i/ J- Jl i)< £> ft x A I t e r ornan a s s p . S D - 4 
02T- li JUmHI ?Bftft1?ft5. 6 kb N _Ps_U flg ft « T? 2 0 k 
b SI ± © (ft E 3S ^ i/ ^ * ;b *< % b ft o 
UIJJJ :ffe©?fe^ifflM^t)©^^>h7>f;i/^^jg 

£?%©% 

HJjB^J9©M^«k^N Alcaligenes s p . P C - 1 © % fe # 
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D N A <D P s t I ffi \t X ^ 2.35 k b ft 35. \z Agrobacterium 
acraat iacus s p. no v. M K 1 © c r t ff £ tjAl & =p £ £ fr p H 

A Brfri/N^'J^-fXt«fi«^$tttKi^<)^ 
A 1 ca 1 i gene s © % fe ft DNA £ Ps t 1 T? "/8 ft L 
fe^N 2 ~ 3 . 5 k b ©f"'fX©DNA «rJt*7#n-*y/i/ 
€m^cffi*{-«k!9l5liRL^o 0 iR L rt: D N A »t Jt £ ^ * * 

— pBlnescript II S K + © P s t I 05 fir {C T 4 D N A U # - -t? £ 
ffl^TlAU *H 1 DH5 a i: I A L t , Alcaligenes 

£ > Agrobacterium © crtW j: c_rj_2 3t ? £ £ 1 . Skb © 
DNA lit ^ ^ d - y i L fc 3 n z - a ^ y ij y ^ t' - •> 
3 > {fc L fc IS & ^ ^ 5 0 0 0 =3 o--jf>&*s>x<{ 7 3 
° - - £ 1 o ¥ III L 0 ft % 3 n i - a ^ 7 'J y 'f t' 

- 3 > © £ #f* % mfl 9 Ifctlf >^Dfft i 
& # T ft o ^ T x lbtlfc3Di-^^77; 
KDNA *J**L, _Pj_tI I ft L t , 0&3:*i*:DNA ff 

Ullii^t llT^K i*^^^ 0 KT> 3 o 

© H « DMA «rfr®**&£ffilOlc*lfclJ--!r>afff 
ST^'f ^'J^-fXtil. 35 kb Bifrlo^#IL> 7 
# d - xy*ISl*ft4*ffll»T:0!. Ji kb P s 1 1 m 
£ IeI iK L n pBlnescript II SK + © P s t 1 gff fir ic $ 

A L T n 7 7 x S KpPCll . p P C 1 2 £ft!i!l*: 0 pPCll 
£pPC12 iz H T n ± 15 2.3 5 k b P s t I »f W i)< p B I n e s 
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cript II SK + © P s t I m ffi fc £ ^ fl» |SI § IC if A $ tl tz 
Ut^o pPCH ©$JI8&ft*&BI£gl 9HKiSt, 
H % flJ 1 1 : Alcaligenes ± g > h 7 4 

pPCll RtfpPCU £ £ - *nf >l4^1ii:IAL 
tz t C 5 > l5#«7^^*U->f->^©^lS ( H % 01 1 
2) lwJ;t)efe3ozi-fi:ii^fc # „ ft ff fc X 
©£fifct6*-5x.ttfr-»fco Lfc*<oT, "T 9 5 X 5 KpPCl 
1 fc:fcttS7X**tf>*:'£Jifc*fc*f*©;£fi'M:'<$' 
* - 1 a c 7v*-fi-<Z>m%tmCt:&Zt%x.l>tltz 0 
tfz, pPCll *'j3e>S4*liC*AlTi, *rfc 
<E6ilil4$ti4*>ofc0T, P P C 1 1 »c ti «; a tr > » 
ft * * a ? « £ * ft r t,* £ # 6 * o £ o 

pPCll © ff A Br Jt © 9 $ £ AS © 0. 72 kb B s t E 1 I - Eco 
RV»rJt£lI£ig<lF#*:77XS K C p P C 1 7 £ ft € N mi 9 
EI) * jS - *nf >i4^Hi;|Alt^ pPCll * 
3*ALfc»££H»l::> + 
t>f>^*^itSCi*<<3^^fcfcJ6 ( II 1 2 ) 
> C CD p P C 1 7 IC # S 1.6 3 kb P s t 1- B s t E 1 I Br Jt © & 
gBE?iJ©&££fro*:o 

* * g H # ft S¥ fi p P C 1 7 i pPCl 2 ^ffll^Ts £i T © ¥ 

jm-p^ofco p p c 1 7 * p p c 1 2 » * ft ft 1 o # g £ _k 

Ml t HU d I I I t tt K p n 1 £ _E_c_o R li L fcl> 

7 x a/^ n o*;i/Ajtflj4if i ^ ; - SS 
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}C £ «9 D N A £ IhI tf£ L rt: 0 tl tl © DNA £10 0 n 1 ©Ex 
o I I l'< v 7 7 - (50 is M Tris-HCI, 100 m M N a C 1 , 5 mM 
MgC 1 2. 10 mM 2- U*^ hx^y-A, p H 8 . 0 ) K & 
^ L \ 180 h©EioIIl5?U7--t?£;fjn/tT37 , t 

•c-^zaL^o l ft z t \z i o fi i s^^^'J^^lt^ 2 

t>^^fo, 2 0 A 1 © MB'< y 7 7 - (40 mM Na-Ace 
tate, 100 mM NaCl, 2 mM Z n C 1 2 , 10 % 7 'J -fe n - Jl , 
pH'4.5 ) 0A^fc*i2)f i-^ClLfc. if" > 7° U > 
^TI^ *f t> ti fz 5 * © f- * - 7 £ 65°C ^ 10£fflft& 
LT#3R*£g3#;fcfcx 5 -x - v N ? > / h* - > ? 

*l/7-tf*Jn*T37 , C-e30aiHlfcflLfco SJftgU 7 
^n-^y;l/M^^cgf{c«k?) > 1 o © 7° 5 x 5 K i: o u 
T10a«D**i*n*$c©g**«ftttSDIA Bf * E « L 
o HIJKL^DNA {iKlenow fragment KJ:*)5fc$&£2£M 
ItU 1 6 °C . -ft7-fT - a >SJSlfc^ x *8§@JM 
109 £ ^ fi $5 & L £ o fc&ftfca^rO^o-vfco^T 
^ ;U a? - 7 7-$>M13K07*EHTl**DHA i&iUU 

7 7 y J V '< 4 it 9 S X T J* ( <* ) OS7fe^7-f 7-D--f 

? ;P •> - ? x > * * y h£E^T'>-*x:/*Kl6;*fT 
^ > § » •> - ^ i > * - * M ^ T DM A E ?iJ £ & £ L 0 
*©&JimS>*l*:1631i&g*f (bp) 5> * D N A £ ?ij 
^11 6-1 ( IB ^1 If 7) {c^-fo §S&a K>® 

ffllcU#v-A|S^*ffi*<#«t**-7'> • U - x ^ 
7U-A©&3t©|±m> 2l© + t> h 7 4 ;b £ 
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$ it & =? c r tw , cj_t_Z © ( H ifiiaj 1 

3) - FL?5 2oCt-^> 

iJ-f-f>^7l/-A (116-1 8 0* A 6 B ( £ 

?|J g -f 7 © £ g ft g 9 9 ~ 8 2 4 K C ^ & D ( £ g ft g 8 2 4 
- 1 3 0 9 ) i)< JL tti $ ft fc 0 

II % flj 1 2 : A 1 c a I i g e n e s * » > h 7 ^ ;U £ 5g it & ? 

(1) 7X**f->?->> 4 - yh^7 + 1f>^>0|5| 
C 

SSiiflliiTftaifcpPcn o-ru-i/a^^^x^ k 

© O t> x £ flj © Bst E II Jfii^Il 7 H © £ S # 1 1 
62 (E?'J#-§7©:£gftfill62) 3T*£L*:xL'-v' 

k ( c r tw © h £ ft 3 ) *pPcn-3 (mi 

9 EI ) <!: £ tf *: 0 

Jl 7 (1) ) * A L fc t> © (^11 (pACCAR25A crtX^ 
pPCH-3)) («fe*SLT^«) £ 150 £g /mlOAp 
£30 m / m I © C id £ £ tf 2 Y T Jg Jffi 2 U v MH* > 3 7 1 > 
1 8 B# fel £ S L o ^iti^iILti»*> 300 ■ 1 

or* h >cit)aiiiiLfc. z. ti z m.m'& ^ 200 mi © 
9n<a*A,j*/**j-b (9 / 1 ) X- 2 mmm i ^ m 

/ - ;u (9 / 1 ) ic & «? & , ***tt«©£BiflJ'>y* 
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(15/1 ) T?g§B"**SC£K:J:!J % i|?D7 ^7 7 
4 - (TLC ) Zft r> fz 0 x © ® 3t {* N C © T L C lc «fc 
t)s Rf m 0. 5 4 (&25%) % 0. 72 (fa 30%) > 0. 91 
( fa 2 5 % ) O 3 7# 7 h K^^nfc, Rl IS 0.54 £ 
0.72 ©fe^^ TLC 7 L/ - h ^ £ N S i /Jn* 
©?nn*;i/A/^*y-;u (9 / \ ) li ^ ; - 
^ I: g i L > t 7 7 f^LB-2D #5A^nv h 5 7 
-f - (15 X 300 mm) ,» tuu&hk/tfj- 

>\> (9 / 1 ) *^tt>^y-;UTJg§§^ait-5Ci{w«fc 

!9> & * © frfc £ £ fa 1 Hp'oit 0 

CtibOftfC^mf^-prix^^ fd-MS 
x ^ ? h;b N <k s •> U * y JH t C © & Ifr g ( * □ o 

* Jl A / / ? J - )l (15/1 ) T-MBi) tf< > 4-^ K 
7 * if > f- > N & N 7X^ + f >f >®i$ D ° ( £ g 
M6, 7(1) ) £t-?T-ffcL*:)tJ6 N c ft S> © ft Jf £ 
4 - h M 7 * * > f- > (Rf 0.54)sRffs7X** 

* > f- > (Rf 0.72) £E£Lfc 0 Rf fit 0.91 
©fe^{i^>^+^>f->T-*ofc ( H jfe 0J 1 2 (2) ) o 

*tL^ • j PC 11 *fctt|PC17 £ jS - *nf 
JMlOl I:|AIU«) (*8§® (pACCARl 6 A c r tX N pPCl 
1 SfcttpPCll)) (SfiSlLTl^) t,7X^f> 
4 - *r Y M 7 >f >^ 
I4t5 : i^lSO^Iff I: J; OJtSSftfco 3 6 in x 
C ft £ (i N $»©7x^:i*1f->f->££gt-<5C£j&< 
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(2) ft > * * V > * > <D mfe 

pPCM-3 £ 0 - * n f- > l^^if J U 1 0 1 fcaliAl*: 
*> © (*B§@ (pACCARl 6 A c r t K pPC17-3)) ( M fe £ M 
L T ^ 5 ) £ 150 jug / m 1 ® Ap £ 30 Hi /ml©Cm£ 
£t?2YT igifc2 »J v Y 37°C N 1 8 B# HI £ S I fz 0 % 

mmfrt>mmit:w#z^inm\ 0 7 t h > i: j; 5 a 

ffiLfco C *l £ 21 SI & ^ 200 ml CD*DD*;l/A/** 
V - ;l/ (9 / l ) T* 2 @ & Hi I . ^ & @ L 0 $ £ £ . 
:n^/hi©Mn^;U/^;-;i/ (9 / \ ) icfg 

^ T x ?nn*;t,A/ < **y-;i, ( 5 0 / 1 ) t-Ilt« 

Ii?n7^77^ - (TLC ) £*T^fco 
ttfefe*£# ® 40K K II 3 * * * t>-» V>ft»* % TLC 7 
l^-h*>bx fr^io.fco $ b C'M© ^ n d * ;U A / 

y ^ > - ;i/ (9 / l ) *fctt^oo*^A/y^y-^ 

(l/l) l= ®M&* t7 7f ^^A^n^h^ 
574 - (15 X 300 mi) tc ^ # > *pd*;UA/.X£ 
y - ;b (9 / 1 ) tfcli^nn*;^/^;-;Kl/l) 
Tl§ISHlt5Cil:J;t) N $6 nn £ 2 n ife, 

*fef i:fitt5i^. - Rlix^^ FD-MS x ^ * 

h ;P ( 0 / e 5 6 4 ) n is <fc N U * y ;H L C © & ft ft 
dd^;^/^;-^ (50/1 ) T g §9 ) # > * 
> * * If > f- > <D m m n°c (BASFttgi) tt^t-su 
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ma s t ^ - rts^^^ h/K s/ y * y ^ tl c © & a g 

(^Dn**A/^;-A (50/1 ) T- jg §§ ) » 43 <fc 
& s J '< >« v 9 HR 6 ft C 1 8 (3.9 I 300 ma) ( £ t - * 
- X £8) £jgt'*:IPLC0&ttft (7t ^FU/y 
^-*/I-?b/<;-A (90/6 / 4 ) g B§ ) (H 
S£ 0J 7 (2) # J& ) J:!)x^^; >tS5i#il^fco 
tt n &ft£ftfcfe3l©g!3©lO%li*£lL;©0- * a 

f > t * O o 

(3) MT^^ry^yfDn'S. 
pPCll tc tf £ 1.15 kb Sai l #r £ p P C 1 1 i H t J 
|S]{CpBluescript II SK + ffl Sall^fir |: iA L ^ 7 Z ; 
. K ( p P C 1 3 £ # £ ^ ^19H#I) £ ft 8! I £ 0 

C © p P C 1 3 * 0 - *n^>^^^li®]M101 g A L 
f><0 (*B§® (pACCARl 6 A c r tX> pPC13)) ( Jt fe £ £ 
L T l> 5 ) £ 150 is / m I © Ap £ 3 0 i g / m I © Cm£ 
# tf 2 T T « 2 U y 3 TCU 1 8 B# jg § L £ 0 « 

300 ml © 7 -tr h > <fc #4 
tBLfco *l £ M $! & x 200 ml ©*nn*;UA/y* 
J - * (9 / 1 ) T* 2 Ie! & Hi I ^ i^gilfc 0 $t>{C> 
C ft £ /J\ fi © ? a n * ;l/ A / ^ ^ y - jl (9 / i ) f- $ 

^ T > ^DD^^A/y^y-;!/ (9 / 1 ) TJgP&^-a 
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pPCll £ 2: ft -5 2 . 3 5 k b © P % t I #r © ? "5 n 0 - 

jfcflO© 1.6 3 kb Ps t 1- B s t E I I ft W (pPCl? > 
* 1 90) Otfi:tiftT^5Ct^Iif?!Il Utf l 
2(1) ©»*«fct)iPI*"e**o Lfti«iT» 0.7 

2 kb Bs t EI 1- Ps t I IfrttifSft C©pPC17 © 1.6 

3 kb Ps t 1- Bs t EI 1 Br M" © * IC 1 4r 9f © _S_mj. ! Ifti 1 
y flfrOJjJI S5ffii<#4t5 (mi 90) o C © Sma I 
fe J: Sai l $#©£©0.65 kb fc«ktf0.69 kb © % Jt 
ttDIU, I* f ft t> hg^AB£^?£tt#ftf<*£<5 
C i li> :tl&Ofl/--> 3 >^77; K £ £ - # o ^ 
>i4*lii:*ALfct)0*fflHfcfi*^|ffi:J: & fig 
B L fc „ £ > 1.63 kb P s t 1 - B s t E 1 1 0f fx © £ ffl © 0 
.69 kb P s t 1 - Sai l BfrJt&pBluescript SK+ © P s t I - S 
111 A I 7X= Ktt^r MVASXISjli 

C © :/ 5 $ K £ £ - # n ?• > g £ * 

■ ■fc*ALfcfc©*J§^fc6*atlrfc:J;DWBLfc 0 
- £ > £ fflj © B s t E 1 1 J^^^Il 7SciS|f in 

2 (E?iJ#^7©£S&gll62) iT'^^ Lfcrlz-v'' 

a > 7 5 X S K pPCll-3 (^19 0) ICIU MlAg 

IStti^ltKi 1 2 ( 1 ) { 2 ) ) 

pPCU-3 © 1 1 6 2 b p 10r Ji" ft £ - *of >ifcmf7 

*1f->^>£g®£ Lt* > ? * V > ?■ > Z t: li. 7 Z * 
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S m a I U®. t Sai l «tt*<ff « L t ^ 5 i *i & ft 5 « 

> o fi'w e y # v - a is ^ « * s * - ^ > • u - 

fO^'7L'-A^ttUt5:i^tnO'C> Cft 
c r tW it ? £ ift € L o C <P c r tW i&&=? © tt£ £ 
Jijia - K $ ft * # U ^ 7 * K © 7 * y M E 3*1 tt * 1 3 

-140 ( e ?"J § -t 5 ) t^nti^o 

C © At ca 1 i genes sp. PC-1© crtW it fe ? g ftf (CrtW) 
Ji > £ - >f * y > 8 (yS-ionone ring) ® 4 ffi © f- U 

> (methylene ) 15^ (keto) £lC|£SI1-Si£3gS 
ttHLT* 5n t<D|#tt4«<Dloi«, 0 - -h u f- 

> C ^ -carotene ) ^Ili Ltx+^; > (echineno 
ne) £gT#>**1i->f-> ( c a n t h a i a n t h i n ) £ £ $ 
t2llgtt?*S ( 2) . mi 1 B#JR) 
o $ b Iw ^ cjjjit fe?^^«> 3 - t K o * 5/ - $ - << 
t J y M C3-hydroxy->9 - ionone ring) (D 4 fi© ^ f 1/ 
y (methylene ) m * >r h (keto) %\z%k&-?2>Wm% 
tttfLtfe!3> *<D*ft»««<!)lo^ N -tf 7 * > 
* > Oeaianthin) ^IIiU4 - h 7 * t V f 

> ( 4-k e t o z e a x an t h i n) £ g T 7 X * * -9" > f- > (asta 
xanthin ) i^lt^lgttt'U ( & 16 01 1 2 ( 1 

) , % I 1 B # A3 ) o « > CO J: ^ tti*fitt*tt 
3 # U ^ 7 f- K <fc C ft £ 3 - K t 3 DN A £ t* x ft * 
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to t> n t ^ « ^ o fc t 0 f * ^ :o#H^f Fifc 

£ 0 «k 9 * U ^ 7 f- K 3 (* DNA ft£*>£#fi«jtt** 
o ?-(ifLT^5^o & > Agrobacterinm a n r a n t i a 
cbs s p. 1107. MK 1 t Alcal igenes sp. P C — 1 Fa! <D e r t W it 
fc^m® ( CrtW) (i. 83 % <D 

7 4 9 y t 4 t 4 - il^Sl^to^-ifLTt^ 

0 * -C > 7 r > tt .4 T 4 - Sf ttlM!«0iS«07 
W ft E ?<J tt s »*OftttK:*nsir»STt4tt^i* 

1 < 6 0ft07 Wlill, & £ t,Mi > * * fc ti 
ffe©7 $ yft©tfjD*ftoT4>»*jgttfc^fftt#;ttt 

4&©;<f-P>g£fi&^l>gKgglt*,e- ^ ;* y > 
* ft It 3 - k K n * 2/ - /8 - >Sy HIAII 

C >S - ionone or 3-hydroiy- /S-ionone ri.ng ketolase ) 
*3-FU^Ul»Kii<?i«, tt 43 x iff - *f * 
y>3^^3 - k K d * 5/ - j8 - >«pR6f , 1 

(2) 7KftS« ABPfjt&fcT© ft£ 
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0 - ii a ? > k> 7 Z * V > ?■ y <D & ft \z &m ft t 
~c X © it & =¥■ it > pPC17 © 1.63 kb Ps t 1- B s t B 1 I Br Jt 
(11 9 El) <D + i:^*nT^5, Z © p P C 1 7 © 1.63 
kb Mi l- B s t E I 1 tSfH-OfrfclfrBft© Sai l 
t5o C©_S_aH «ffi®£ffl®Brttfc*eSi*AS3RS 

lillJ. #11 2 ( 3) © *_ 31 <fc «9 BJ§ 
£ * o I fz o T > TK^g^A^^itfe^li. 1.63 kb 
Pstl- Bst EIl Br it ® 9 t> £ ffl'J ® Br W X & S 0.94 kb_j3 

il i -_bjj_e ii mfrK&&tzzti)<frfrz c &mmm& 
£ © is m , deathless*** &a & n k>oiiic 

j_LL.it &=? t $ % I rt: o C® c r n ig fe^O^gE^iJia 

- Fns^H^f K © 7 5 ;il?ijB$l 5 BO. ( E 
5JSf 6) I: ^ § ti T. U 3 o 

C g> Alcaligenes sp. PC-1® crtZ itfe?!!^? (CrtZ) 
It ^ £ - 4 * J y M ( £ - i onone ring) © 3 ft © $ IC 

i-o®*mmzttijatzwmm&zGiTi3<o^ * © n 

# ft £ © 1 o # N £ - n * > C -carotene ) £ g' 
Ii LT i8 - ^ 'J 7 1 h ^ t > f > (£ -crypt oian thin) 

* MX H 7 * V > f > C leaxanthin) * _r fig 1" 3 g£ ft £ 

(Xtfll 2 { 3) ^ % 1 1 0 # ,1 ) o $ 6 > 

c r tZ it T H $J tt > 4 - * h - £ - ^ # y > ^ (4-keto 

- /8-ionone ring) O 3ffi©KSi: loCTkil'&ltllll 



-71- 



WO 95/18220 



PCT/JP94/02220 



t^it gtt^iLTi-^ z <d m # ft ti m <d i & , 

i] y 9 * V y * y (canthaianthin ) LT7i 
3 y f- y ( p h o c n i c o i a n t h i n ) ^StT^^ + t 
y * y (astaianthin ) £ & J&t Z & & T' * * (M 

i« i 2 t i) , % i i si # m ) o £^ m ^ © b 3t s 

ti N lia^nTl^^ofctOt'ifeSo 3: rt: N A I c a 1 
i gene s sp. PC-1 © © CrtZ fi x 7;;iE^Jl/K;K> 
E r t i n i a aredovora © CrtZ i SIX O 7 f > f -f f -f 
-i^5il*^**o-?-*jRLfco ft x Ag ro b a c t 
e r i a m a a r a n t i a c a s s p. nov. MK 1 £ Alcaligenes sp. P 
C-l © c_r_t_Zit fc^ Ife ( CrtZ ) {i N 7 5 V RE fl| 

^ / </^t^ SD%©7'f7>7-f f ^ - i^9ll^tD 

->* - £ W L T *i *) > S^©^tlfeM^T'|Hl-T*5o C 
*i £ © 7 $ 7 $ 8S ?'J © * T x 7^Tx>x^ x-f - 

* vm o % © $ g © 7 $ j m m n i* > **©*nic*na- 
fii 8 jc * ^ t tt x 4>l< &^ft©7s;iii», * s 

IM* , * £ > *^(iflfe©75y^©#Jn£fToTfe#S 

(3) -?4 + -iz*c'*y t y < Ji&&j&m&®^m 

fit ft ^ £ 8B S Erwinia ^ 3fc £ $ $3 S Rhodofaacter © * 
n^y>T K^J&itfei^JB^fcg*©^^!;:.);^ — jft 
, ^o^y^ K £ & fifc B * t* * If iU*of ^ 
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fc o fc i x- \t n Ervini a CD 'J 3 f > Sffcgf Ifi? 7 * 
5 c_r_m* 'J 3 ^HC*^f^*B»tTIftt5. 
L ifi r> X s Rhodobac te r <D 7 >r hxyflff-A^-'tf' 
it a ? c r 1 1 £ JS l> * C £ J; *9 'J 3 t* !) I: ; >f 

nx#l/>£:*;I§Sl;:&J&£-frN C tlfc E r w i a i a <P c r t Y 

£ # £ ft L 0 - .^7*of ^Iit2 (Linden, 
H. , M i s a w a , N. , Chamovita, D. , Pecker, I., Hirschb 
erg, J., Sandmann, G. , 'Functional complementation 
in Escherichia coli of different phytoene desator 
ase genes and analysis of accumulated carotenes'. 
2. Naturforsch. . 46c, p. 1 0 4 5 - 1 0 5 1 , 1 9 9 1 ) o t fz ^ 
* IS IB lw V* T % 0-- *> ° * > * * 7 * * > * > K Cr 
tW£ ttffl £ •£ £ £ ^ lo<r Mi«$ASnfci** 

of>^*>^^t>f>i:CitZ*ftffl$1r5i> * f 

So I K. & i T ^ fz t X. M x E r t i n i a O c r t E ^ c r t B y c_r_ 
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: 1 
: 6 3 9 

ra©M : mom : 

i^iJOTO : Genomic DNA 
jg® : 

: Agrobacterium anrantiacns 
Jfcg : sp. nov. MK1 

GTG CAT GCG CTG TGG TTT CTG GAC GCA GCG GCG CAT CCC ATC CTG GCG 48 
Met His Ala Leu Trp Phc Leu Asp Ala Ala Ala His Pro lie Leu Ala 

15 10 15 

ATC GCA AAT TTC CTG GGG CTG ACC TGG CTG KG GTC GGA TTG TTC ATC 96 
He Ala Asn Phe Leu Gly Leu Thr Trp Leu Ser Val Gly Leu Phe lie 

20 25 30 

ATC GCG CAT GAC GCG ATG CAC GGG TCG GTG GTG CCG GGG CGT CCG CGC 144 
He Ala His Asp Ala Met His Gly Ser Val Val Pro Gly Arg Pro Arg 

35 40 45 

GCC AAT GCG GCG ATG GGC CAG CTT GTC CTG TGG CTG TAT GCC GGA TTT 192 
Ala Asn Ala Ala Met Gly Gin Leu Val Leu Trp Leu Tyr Ala Gly Phe 
50 55 60 
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TCG TGG CGC AAG ATG ATC GTC AAG CAC ATG GCC CAT CAC CGC CAT GCC 240 

Ser Trp Arg Lys Met lie Val Lys His Met Ah His His Arg His Ala 
65 70 75 80 

GGA ACC GAC GAC GAC CCC GAT TTC GAC CAT GGC GGC CCG GTC CGC TGG 288 

Gly Thr Asp Asp Asp Pro Asp Phe Asp His Gly Gly Pro Val Arg Trp 

85 90 95 

TAC GCC CGC TTC ATC GGC ACC TAT TTC GGC TGG CGC GAG GGG CTG CTG 336 

Tyr Ala Arg Phe lie Gly Thr Tyr Phe Gly Trp Arg Glu Gly Leu Leu 

100 105 110 

CTG CCC GTC ATC GTG ACG GTC TAT GCG CTG ATC CTT GGG GAT CGC TGG 384 

Leu Pro Val He Val Thr Val Tyr Ala Leu lie Leu Gly Asp Arg Trp 

115 120 125 

ATG TAC GTG GTC TTC TGG CCG CTG CCG TCG ATC CTG GCG TCG ATC CAG 432 

Met Tyr Val Val Phe Trp Pro Leu Pro Ser He Leu Ala Ser He Gin 

130 135 140 

CTG TTC GTG TTC GGC ACC TGG CTG CCG CAC CGC CCC GGC CAC GAC GCG 480 

Leu Phe Val Phe Gly Thr Trp Leu Pro His Arg Pro Gly His Asp Ala 
145 150 155 160 

TTC CCG GAC CGC CAC AAT GCG CGG TCG TCG CGG ATC AGC GAC CCC GTG 528 

Phe Pro Asp Arg His Asn Ala Arg Ser Ser Arg He Ser Asp Pro Val 

165 170 175 

TCG CTG CTG ACC TGC TTT CAC TTT GGC GGT TAT CAT CAC GAA CAC CAC 576 

Ser Leo Leu Thr Cys Phe His Phe Gly Gly Tyr His His Glu His His 

180 185 190 
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CTG CAC CCG ACG GTG CCG TGG TGG CGC CTG CCC AGC ACC CGC ACC AAG 624 

Leu His Pro Thr Val Pro Trp Trp Arg Leo Pro Ser Thr Arg Thr Lys 

195 200 20S 

GGG GAC ACC GCA TGA 639 

Gly Asp Tor Ala *** 
210 
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E?iJ©g$ : 4 8 9 

h#nv- -.mm 

EJijOgg : Genomic DNA 

mm-. 

: Agroba c t e r i am aorant iacns 
tfc& : sp. no?. MK1 

WM 

ATG ACC AAT TTC CTG ATC GTC GTC GCC ACC GTG CTG GTG ATG GAG TTG 48 
Met Thr Asn Phe Leo lie Val Val Ala Thr Val Leo Val Met Glo Leo 

.1 5 10 15 

ACG GCC TAT TCC GTC CAC CGC TGG ATC ATG CAC GGC CCC CTG GGC TGG 96 
Thr Ala Tyr Ser Val His kxg Trp He Met His Gly Pro Leo Gly Trp 

20 25 30 

GGC TGG CAC AAG TCC CAC CAC GAG GAA CAC GAC CAC GCG CTG GAA AAG 144 
Gly Trp His Lys Ser His His Glo Glo His Asp His Ala Leo Glu Lys 

35 40 45 

AAG GAC CTG TAC GGC CTG GTC TTT GCG GTG ATC GCC ACG GTG CTG TTC 192 
Aso Asp Leo Tyr Gly Leo Val Phe Ala Val lie Ala Tor Val Leo Pbe 
50 55 60 
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ACG GTG GGC TGG ATC TGG GCG CCG GTC CTG TGG TGG ATC GCC TTG GGC 240 
Thr Val Gly Trp lie Trp Ala Pro Val Leo Tip Trp lie Ala Leo Gly 
65 70 75 80 

ATG ACT GTC TAT GGG CTG ATC TAT TTC GTC CTG CAT GAC GGG CTG GTG 288 
Met Thr Val Tyr Gly Leo He Tyr Phe Vai Leo His Asp Gly Leo Yal 

85 90 95 

CAT CAG CGC TGG CCG TTC CGT TAT ATC CCG CGC AAG GGC TAT GCC AGA 336 
His Gin Arg Trp Pro Phe Arg Tyr lie Pro Arg Lys Gly Tyr Ala Arg 

100 105 110 

CGC CTG TAT CAG GCC CAC CGC CTG CAC CAT GCG GTC GAG GGG CGC GAC 384 
Arg Leo Tyr Gin Ala His Arg Leo His His Ala Val Glo Gly Arg Asp 

115 120 125 

CAT TGC GTC AGC TTC GGC TTC ATC TAT GCG CCC CCG GTC GAC AAG CTG 432 
His Cys Val Ser Phe Gly Phe lie Tyr Ala Pro Pro Val Asp Lys Leo 

130 135 140 

AAG CAG GAC CTG AAG ATG TCG GGC GTG CTG CGG GCC GAG GCG CAG GAG 480 
Lys Gin Asp Leo Lys Met Ser Gly Val Leo Arg Ala Glo Ala Gin Glo 
145 150 155 160 

CGC ACG TGA 489 
Arg Thr — 
*** 
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EMS* : 3 

: 1 1 6 1 

raoM : mom. : 

12?|J©S8I: Genomic DNA 

: 

£#g : Agrobacterium aurantiacus 
: sp. nov. MK1 

W&\ 

GTG ACC CAT GAC GTG CTG CTG GCA GGG GCG GGC CTT GCC AAC GGG CTG 48 
Met Tbr His Asp Val Leu Leu Ala Gly Ala Gly Leu Ala Asn Gly Leu 

1 5 10 15 

ATC GCC CTG GCG CTG CGC GCG GCG CGG CCC GAC CTG CGC GTG CTG CTG 96 
He Ala Leu Ala Leu Arg A La Ala Arg Pro Asp Leu Arg Val Leu Leu 

20 25 30 

CTG GAC CAT GCC GCA GGA CCG TCA GAC GGC CAC ACC TGG TCC TGC CAC 144 
Leu Asp His Ala Ala Gly Pro Ser Asp Gly His Tbr Trp Ser Cys His 

35 40 45 

GAC CCC GAC CTG TCG CCG GAC TGG CTG GCG CGG CTG AAG CCC CTG CGC 192 
Asp Pro Asp Leu Ser Pro Asp Trp Leu Ala Arg Leu Lys Pro Leu Arg 
50 55 60 
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CGC GCC AAC TGG CCC GAC CAG GAG GTG CGC TTT CCC CGC CAT GCC CGG 240 
Arg Ala Asn Trp Pro Asp Gin Glo Val Arg Phe Pro Arg His Ala Arg 
65 70 75 80 

CGG CTG GCC ACC GGT TAC GGG TCG CTG GAC GGG GCG GCG CTG GCG GAT 288 
Arg Leu Ala Thr Gly Tyr Gly Ser Leu Asp Gly Ala Ala Leo Ala Asp 

85 90 95 

GCG GTG GTC CGG TCG GGC GCC GAG ATC CGC TGG GAC AGC GAC ATC GCC 336 
Ala Val Val Arg Ser Gly Ala Glo He Arg Trp Asp Ser Asp He Ala 

100 105 110 

CTG CTG GAT GCG CAG GGG GCG ACG CTG TCC TGC GGC ACC CGG ATC GAG 384 
Leo Leo Asp Ala Gin Gly Ala Thr Leo Ser Cys Gly Tor Arg He Glo 

115 120 125 

GCG GGC GCG GTC CTG GAC GGG CGG GGC GCG CAG CCG TCG CGG CAT CTG 432 
Ala Gly Ala Val Leo Asp Gly Arg Gly Ala Gin Pro Ser Arg His Leo 

130 135 140 

ACC GTG GGT TTC CAG AAA TTC GTG GGT GTC GAG ATC GAG ACC GAC CGC 480 
Thr Val Gly Phe Gin Lys Phe Val Gly Val Glo He Glo Thr Asp Arg 
145 150 155 . 160 

CCC CAC GGC GTG CCC CGC CCG ATG ATC ATG GAC GCG ACC GTC ACC CAG 528 ' 
Pro His Gly Val Pro Arg Pro Met He Met Asp Ala Thr Val Thr Gin 

165 170 175 

CAG GAC GGG TAC CGC TTC ATC TAT CTG CTG CCC TTC TCT CCG ACG CGC 576 
Gin Asp Gly Tyr Arg Phe He Tyr Leu Leo Pro Phe Ser Pro Thr Arg 
180 185 190 
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ATC CTG ATC GAG GAC ACG CGC TAT TCC GAT GGC GGC GAT CTG GAC GAC 624 

He Leu He Gin Asp Thr Arg Tyr Ser Asp Gly Gly Asp Leu Asp Asp 

195 200 205 

GAC GCG CTG GCG GCG GCG TCC CAC GAC TAT GCC CGC CAG CAG GGC TGG 672 

Asp Ala Leu Ah Ala Ala Ser His Asp Tyr Ala Arg Gin Gin Gly Trp 

210 215 220 

ACC GGG GCC GAG GTC CGG CGC GAA CGC GGC ATC CTT CCC ATC GCG CTG 720 

Thr Gly Ala Gin Val Arg Arg Glu Arg Gly lie Len Pro lie Ala Leu 

225 230 235 240 

GCC CAT CAT GCG GCG GGC TTC TGG GCC GAT CAC GCG GCG GGG CCT GTT 768 

Ala His Asp Ala Ala Gly Phe Trp Ala Asp His Ala Ala Gly Pro Val 

245 250 255 

CCC GTG GGA CTG CGC GCG GGG TTC TTT CAT CCG GTC ACC GGC TAT TCG 816 

Pro Val Gly Len Arg Ala Gly Phe Phe His Pro Val Thr Gly Tyr Ser 

260 265 270 

CTG CCC TAT GCG GCA CAG GTG GCG GAC GTG GTG GCG GGT CTG TCC GGG 864 

Leu Pro Tyr Ala Ala Gin Val Ala Asp Val Val Ala Gly Leu Ser Gly 

275 280 ' 285 

CCG CCC GGC ACC GAC GCG CTG CGC GGC GCC ATC CGC GAT TAC GCG ATC 912 

Pro Pro Gly Thr Asp Ala Leu Arg Gly Ala lie Arg Asp Tyr Ala He 

290 295 300 

GAC CGG GCG CGC CGC GAC CGC TTT CTG CGC CTT TTG AAC CGG ATG CTG 960 

Asp Arg Ala Arg Arg Asp Arg Phe Leu Arg Leu Leu Asn Arg Met Leu 

305 310 315 320 
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TTC CGC GGC TGC GCG CCC GAC CGG CGC TAT ACC CTG CTG CAG CGG TTC 1008 
Phe Arg Gly Cy s Ala Pro Asp Arg Arg Tyr Thr Leo Leu Gin Arg Pie 

325 330 335 

TAC CGC ATG CCG CAT GGA CTG ATC GAA CGG TTC TAT GCC GGC CGG CTG 1056 
Tyr Arg Met Pro His Gly Leo lie Glo Arg Phe Tyr Ala Gly Arg Leo 

340 345 350 

AGC GTG GCG GAT CAG CTG CGC ATC GTG ACC GGC AAG CCT CCC ATT CCC 1104 
Ser Val Ala Asp Gin Leo Arg He Val Thr Gly Lys Pro Pro He Pro 

355 360 365 

CTT GGC ACG GCC ATC CGC TGC CTG CCC GAA CGT CCC CTG CTG AAG GAA 1152 
Leo Gly Thr Ala He Arg Cys Leo Pro Glo Arg Pro Leo Leo Lys Glo 

370 375 380 

AAC GCA TGA 1161 
Asn Ala *** 
385 
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EWSf- : 4 
E?iJ©g$ : 2 8 8 6 

g^iJOg^ : Genomic DNA 

mm-. 

4#g : Agrobacterinm anrantiacns 
: sp. nor. MK1 

eh 

GGATCCGGCG ACCTTGCGGC GCTGCGCCGC GCGCCTTTGC TGGTGCCTGG GCCGGGTGGC 60 
CCTAGGCCGC TGGAACGCCG CGACGCGGCG CGCGGAAACG ACCACGGACC CGGCCCACCG 

CAATGGTCGC AAGCAACGGG GATGGAAACC GGCGATGCGG GACTGTAGTC TGCGCGGATC 120 
GTTACCAGCG TTCGTTGCCC CTA-CCTTTGG CCGCTACGCC CTGACATCAG ACGCGCCTAG 

GCCGGTCCGG GGGACAAGAT GAGCGCACAT GCCCTGCCCA AGGCAGATCT GACCGCCACC 180 
CGGCCAGGCC CCCTGTTCTA CTCGCGTGTA CGGGACGGGT TCCGTCTAGA CTGGCGGTGG 

AGCCTGATCG TCTCGGGCGG CATCATCGCC GCTTGGCTGG CCCTGCATGT GCATGCGCTG 240 
TCGGACTAGC AGAGCCCGCC GTAGTAGCGG CGAACCGACC GGGACGTACA CGTACGCGAC 

TGGTTTCTGG ACGCAGCGGC GCATCCCATC CTGGCGATCG CAAATTTCCT GGGGCTGACC 300 
ACCAAAGACC TGCGTCGCCG CGTAGGGTAG GACCGCTAGC GTTTAAAGGA CCCCGACTGG 
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TGGCTGTCGG TCGGATTGTT CATCATCGCG CATGACGCGA TGCACGGGTC GGTGGTGCCG 360 
ACCGACAGCC AGCCTAACAA GTAGTAGCGC GTACTGCGCT ACGTGCCCAG CCACCACGGC 

GGGCGTCCGC GCGCCAATGC GGCGATGGGC CAGCTTGTCC TGTGGCTGTA TGCCGGATTT 420 
CCCGCAGGCG CGCGGTTACG CCGCTACCCG GTCGAACAGG ACACCGACAT ACGGCCTAAA 

TCGTGGCGCA AGATGATCGT CAAGCACATG GCCCATCACC GCCATGCCGG AACCGACGAC 480 
AGCACCGCGT TCTACTAGCA GTTCGTGTAC CGGGTAGTGG CGGTACGGCC TTGGCTGCTG 

GACCCCGATT TCGACCATGG CGGCCCGGTC CGCTGGTACG CCCGCTTCAT CGGCACCTAT 540 
CTGGGGCTAA AGCTGGTACC GCCGGGCCAG GCGACCATGC GGGCGAAGTA GCCGTGGATA 

TTCGGCTGGC GGGAGGGGCT GCTGCTGCCC GTCATCGTGA CGGTCTATGC GCTGATCCTT 600 
AAGCCGACCG CGCTCCCCGA CGACGACGGG CAGTAGCACT GCCAGATACG CGACTAGGAA 

GGGGATCGCT GGATGTACGT GGTCTTCTGG CCGCTGCCGT CGATCCTGGC GTCGATCCAG 660 
CCCCTAGCGA CCTACATGCA CCAGAAGACC GGCGACGGCA GCTAGGACCG CAGCTAGGTC 

CTGTTCGTGT TCGGCACCTG GCTGCCGCAC CGCCCCGGCC ACGACGCGTT CCCGGACCGC 720 
GACAAGCACA AGCCGTGGAC CGACGGCGTG GCGGGGCCGG TGCTGCGCAA GGGCCTGGCG 

CACAATGCGC GGTCGTCGCG GATCAGCGAC CCCGTGTCGC TGCTGACCTG CTTTCACTTT 780 
GTGTTACGCG CCAGCAGCGC CTAGTCGCTG GGGCACAGCG ACGACTGGAC GAAAGTGAAA 
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GGCGGTTATC ATCACGAACA CCACCTGCAC CCGACGGTGC CGTGGTGGCG CCTGCCCAGC 840 
CCGCCAATAG TAGTGCTTGT GGTGGACGTG GGCTGCCACG GCACCACCGC GGACGGGTCG 

ACCCGCACCA AGGGGGACAC CGCATGACCA ATTTCCTGAT CGTCGTCGCC ACCGTGCTGG 900 
TGGGCGTGGT TCCCCCTGTG GCGTACTGGT TAAAGGACTA GCAGCAGCGG TGGCACGACC 

TGATGGAGTT GACGGCCTAT TCCGTCCACC GCTGGATCAT GCACGGCCCC CTGGGCTGGG 960 
ACTACCTCAA CTGCCGGATA AGGCAGGTGG CGACCTAGTA CGTGCCGGGG GACCCGACCC 

GCTGGCACAA GTCCCACCAC GAGGAACACG ACCACGCGCT GGAAAAGAAC GACCTGTACG 1020 
CGACCGTGTT CAGGGTGGTG CTCCTTGTGC TGGTGCGCGA CCTTTTCTTG CTGGACATGC 

GCCTGGTCTT TGCGGTGATC GCCACGGTGC TGTTCACGGT GGGCTGGATC TGGGCGCCGG 1080 
CGGACCAGAA ACGCCACTAG CGGTGCCACG ACAAGTGCCA CCCGACCTAG ACCCGCGGCC 

TCCTGTGGTG GATCGCCTTG GGCATGACTG TCTATGGGCT GATCTATTTC GTCCTGCATG 1140 
AGGACACCAC CTAGCGGAAC CCGTACTGAC AGATACCCGA CTAGATAAAG CAGGACGTAC 

ACGGGCTGGT GCATCAGCGC TGGCCGTTCC GTTATATCCC GCGCAAGGGC TATGCCAGAC 1200 
TGCCCGACCA CGTAGTCGCG ACCGGCAAGG CAATATAGGG CGCGTTCCCG ATACGGTCTG 

GCCTGTATCA GGCCCACCGC CTGCACCATG CGGTCGAGGG GCGCGACCAT TGCGTCAGCT 1260 
CGGACATAGT CCGGGTGGCG GACGTGGTAC GCCAGCTCCC CGCGCTGGTA ACGCAGTCGA 
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TCGGCTTCAT CTATGCGCCC CCGGTCGACA AGCTGAAGCA GGACCTGAAG ATGTCGGGCG 1320 
AGCCGAAGTA GATACGCGGG GGCCAGCTGT TCGACTTCGT CCTGGACTTC TACAGCCCGC 

TGCTGCGGGC CGAGGCGCAG GAGCGCACGT GACCCATGAC GTGCTGCTGG CAGGGGCGGG 1380 
ACGACGCCCG GCTCCGCGTC CTCGCGTGCA CTGGGTACTG CACGACGACC GTCCCCGCCC 

CCTTGCCAAC GGGCTGATCG CCCTGGCGCT GCGCGCGGCG CGGCCCGACC TGCGCGTGCT 1440 
GGAACGGTTG CCCGACTAGC GGGACCGCGA CGCGCGCCGC GCCGGGCTGG ACGCGCACGA 

GCTGCTGGAC CATGCCGCAG GACCGTCAGA CGGCCACACC TGGTCCTGCC ACGACCCCGA 1500 
CGACGACCTG GTACGGCGTC CTGGCAGTCT GCCGGTGTGG ACCAGGACGG TGCTGGGGCT 

CCTGTCGCCG GACTGGCTGG CGCGGCTGAA GCCCCTGCGC CGCGCCAACT GGCCCGACCA 1560 
GGACAGCGGC CTGACCGACC GCGCCGACTT CGGGGACGCG GCGCGGTTGA CCGGGCTGGT 

GGAGGTGCGC TTTCCCCGCC ATGCCCGGCG GCTGGCCACC GGTTACGGGT CGCTGGACGG 1620 
CCTCCACGCG AAAGGGGCGG TACGGGCCGC CGACCGGTGG CCAATGCCCA GCGACCTGCC 

GGCGGCGCTG GCGGATGCGG TGGTCCGGTC GGGCGCCGAG ATCCGCTGGG ACAGCGACAT 1680 
CCGCCGCGAC CGCCTACGCC ACCAGGCCAG CCCGCGGCTC TAGGCGACCC TGTCGCTGTA 

CGCCCTGCTG GATGCGCAGG GGGCGACGCT GTCCTGCGGC ACCCGGATCG AGGCGGGCGC 1740 
GCGGGACGAC CTACGCGTCC CCCGCTGCGA CAGGACGCCG TGGGCCTAGC TCCGCCCGCG 
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GGTCCTGGAC GGGCGGGGCG CGCAGCCGTC GCGGCATCTG ACCGTGGGTT TCCAGAAATT 1800 
CCAGGACCTG CCCGCCCCGC GCGTCGGCAG CGCCGTAGAC TGGCACCCAA AGGTCTTTAA 

CGTGGGTGTC GAGATCGAGA CCGACCGCCC CCACGGCGTG CCCCGCCCGA TGATCATGGA 1860 
GCACCCACAG CTCTAGCTCT GGCTGGCGGG GGTGCCGCAC GGGGCGGGCT ACTAGTACCT 

CGCGACCGTC ACCCAGCAGG ACGGGTACCG CTTCATCTAT CTGCTGCCCT TCTCTCCGAC 1920 
GCGCTGGCAG TGGGTCGTCC TGCCCATGGC GAAGTAGATA GACGACGGGA AGAGAGGCTG 

GCGCATCCTG ATCGAGGACA CGCGCTATTC CGATGGCGGC GATCTGGACG ACGACGCGCT 1980 
CGCGTAGGAC TAGCTCCTGT GCGCGATAAG GCTACCGCCG CTAGACCTGC TGCTGCGCGA 

GGCGGCGGCG TCCCACGACT ATGCCCGCCA GCAGGGCTGG ACCGGGGCCG AGGTCCGGCG 2040 
CCGCCGCCGC AGGGTGCTGA TACGGGCGGT CGTCCCGACC TGGCCCCGGC TCCAGGCCGC 

CGAACGCGGC ATCCTTCCCA TCGCGCTGGC CCATGATGCG GCGGGCTTCT GGGCCGATCA 2100 
GCTTGCGCCG TAGGAAGGGT AGCGCGACCG GGTACTACGC CGCCCGAAGA CCCGGCTAGT 

CGCGGCGGGG CCTGTTCCCG TGGGACTGCG CGCGGGGTTC TTTCATCCGG TCACCGGCTA 2160 
GCGCCGCCCC GGACAAGGGC ACCCTGACGC GCGCCCCAAG AAAGTAGGCC AGTGGCCGAT 

TTCGCTGCCC TATGCGGCAC AGGTGGCGGA CGTGGTGGCG GGTCTGTCCG GGCCGCCCGG 2220 
AAGCGACGGG ATACGCCGTG TCCACCGCCT GCACCACCGC CCAGACAGGC CCGGCGGGCC 
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CACCGACGCG CTGCGCGGCG CCATCCGCGA TTACGCGATC GACCGGGCGC GCCGCGACCG 2280 
GTGGCTGCGC GACGCGCCGC GGTAGGCGCT AATGCGCTAG CTGGCCCGCG CGGCGCTGGC 

CTTTCTGCGC CTTTTGAACC GGATGCTGTT CCGCGGCTGC GCGCCCGACC GGCGCTATAC 2340 
GAAAGACGCG GAAAACTTGG CCTACGACAA GGCGCCGACG CGCGGGCTGG CCGCGATATG 

CCTGCTGCAG CGGTTCTACC GCATGCCGCA TGGACTGATC GAACGGTTCT ATGCCGGCCG 2400 
GGACGACGTC GCCAAGATGG CGTACGGCGT ACCTGACTAG CTTGCCAAGA TACGGCCGGC 

GCTGAGCGTG GCGGATCAGC TGCGCATCGT GACCGGCAAG CCTCCCATTC CCCTTGGCAC 2460 
CGACTCGCAC CGCCTAGTCG ACGCGTAGCA CTGGCCGTTC GGAGGGTAAG GGGAACCGTG 

GGCCATCCGC TGCCTGCCCG AACGTCCCCT GCTGAAGGAA AACGCATGAA CGCCCATTCG 2520 
CCGGTAGGCG ACGGACGGGC TTGCAGGGGA CGACTTCCTT TTGCGTACTT GCGGGTAAGC 

CCCGCGGCCA AGACCGCCAT CGTGATCGGC GCAGGCTTTG GCGGGCTGGC CCTGGCCATC 2580 
GGGCGCCGGT TCTGGCGGTA GCACTAGCCG CGTCCGAAAC CGCCCGACCG GGACCGGTAG 

CGCCTGCAGT CCGCGGGCAT CGCCACCACC CTGGTCGAGG CCCGGGACAA GCCCGGCGGG 2640 
GCGGACGTCA GGCGCCCGTA GCGGTGGTGG GACCAGCTCC GGGCCCTGTT CGGGCCGCCC 

CGCGCCTATG TCTGGCACGA TCAGGGCCAT CTCTTCGACG CGGGCCCGAC CGTCATCACC 2700 
GCGCGGATAC AGACCGTGCT AGTCCCGGTA GAGAAGCTGC GCCCGGGCTG GCAGTAGTGG 
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GACCCCGATG CGCTGAAAGA GCTGTGGGCC 
CTGGGGCTAC GCGACTTTCT CGACACCCGG 

ACGCTGATGC CGGTCTCGCC CTTCTATCGG 
TGCGACTACG GCCAGAGCGG GAAGATAGCC 

TACGTGAACG AGGCCGATCC AGGGTCTGGG 
ATGCACTTGC TCCGGCTAGG TCCCAGACCC 

GGATCC 
CCTAGG 



CTGACCGGGC AGGACATGGC GCGCGACGTG 2760 
GACTGGCCCG TCCTGTACCG CGCGCTGCAC 

CTGATGTGGC CGGGCGGGAA GGTCTTCGAT 2820 
GACTACACCG GCCCGCCCTT CCAGAAGCTA 

TCTTGCCGTG CCAGGTGAAG CTGTTGCCGT 2880 
AGAACGGCAC GGTCCACTTC GACAACGGCA 

2886 
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wznm : 5 

E?'J©g£ : 7 2 9 

mm : &m = 

SE^'Jom : Genomic DNA 

mm-. 

%M%x : Alcal i gene s 
tfcg sp. PC-1 



ATG TCC GGA CGG AAG CCT GGC ACA ACT GGC GAC ACG ATC GTC AAT CTC 48 
Met Ser Gly Arg Lys Pro Gly Thr Thr Gly Asp Thr lie Val Asn Leu 

15 10 15 

GGT CTG ACC GCC GCG ATC CTG CTG TGC TGG CTG GTC CTG CAC GCC TTT 96 
Gly Leo Thr Ala Ala He Lea Leu Cys Trp Leu Val Leu His Ala Ph e 

20 25 30 

ACG CTA TGG TTG CTA GAT GCG GCC GCG CAT CCG CTG CTT GCC GTG CTG 144 
Thr Leu Trp Leu Leu Asp Ala Ala Ala His Pro Leu Leu Ala Val Leu 
35 40 45 

TGC CTG GCT GGG CTG ACC TGG CTG TCG GTC GGG CTG TTC ATC ATC GCG 192 
Cys Leu Ala Gly Leu Thr Trp Leu Ser Val Gly Leu Phe lie He Ala 
50 55 60 
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CAT GAC CCA ATG CAC GGG TCC GTG GTG CCG GGC CGG CCG CGC GCC AAT 240 

His Asp Als Met His Gly Ser Val Val Pro Gly Arg Pro Arg Ala Asa 
65 70 75 80 

GCG GCG ATC GGG CAA CTG GCG CTG TGG CTC TAT GCG GGG TTC TCG TGG 288 

Ala Ala lie Gly Gin Leu Ala Leu Trp Leo Tyr Ala Gly Pbe Ser Trp 

85 90 95 

CCC AAG CTG ATC GCC AAG CAC ATG ACG CAT CAC CGG CAC GCC GGC ACC 336 

Pro Lys Leu He Ala Lys His Met Thr His His Arg His Ala Gly Thr 

100 105 110 

GAC AAC GAT CCC GAT TTC GGT CAC GGA GGG CCC GTG CGC TGG TAC GGC 384 

Asp Asn Asp Pro Asp Phe Gly His Gly Gly Pro Val Arg Trp Tyr Gly 

115 120 125 

AGC TTC GTC TCC ACC TAT TTC GGC TGG CGA GAG GGA CTG CTG CTA CCG 432 

Ser Pbe Val Ser Thr Tyr Phe Gly Trp Arg Glu Gly Leu Leu Leu Pro 

130 135 140 

GTG ATC GTC ACC ACC TAT GCG CTG ATC CTG GGC GAT CGC TGG ATG TAT 480 

Val He Val Thr Thr Tyr Ala Leu He Leu Gly Asp Arg Trp Met Tyr 
145 150 155 160 

GTC ATC TTC TGG CCG GTC CCG GCC GTT CTG GCG TCG ATC CAG ATT TTC 528 

Val He Phe Trp Pro Val Pro Ala Val Leu Ala Ser He Gin He Phe 

165 170 175 

GTC TTC GGA ACT TGG CTG CCC CAC CGC CCG GGA CAT GAC GAT TTT CCC 576 

Val Phe Gly Thr Trp Leu Pro His Arg Pro Gly His Asp Asp Phe Pro 

180 185 190 
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GAC CGG CAC AAC GCG AGG TCG ACC GGC ATC GGC GAC CCG TTG TCA CTA 624 
Asp Arg His Aso Ala Arg Ser Thr G 1 y He Gly Asp Pro Lea Ser Lea 

195 200 205 

CTG ACC TGC TTC CAT TTC GGC GGC TAT CAC CAC GAA CAT CAC CTG CAT 672 
Lea Thr Cys Phe His Phe Gly Gly Tyr His His Gla His His Lea His 

210 215 220 

CCG CAT GTG CCG TGG TGG CGC CTG CCT CGT ACA CGC AAG ACC GGA GGC 720 
Pro His Val Pro Trp Trp Arg Leo Pro Arg Thr Arg Lys Thr Gly Gly 
225 230 235 240 

CGC GCA TGA 729 
Arg Ala*** 
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E?»]#§ : 6 

g£?ij©g$ : 4 8 9 

m&ia>m : mow. : rxm 

EJiJCDggg : Genomic DNA 

: Alcaligenes 
m : sp. PC-1 



ATG ACG CAA TTC CTC ATT GTC GTG GCG ACA GTC CTC GTG ATG GAG CTG 48 

Met Thr Gin Pbe Leu lie Val Yal Ala Thr Val Leo Val Met Glu Leu 

5 10 IS 

ACC GCC TAT TCC GTC CAC CGC TGG ATT ATG CAC GGC CCC CTA GGC TGG 96 

Thr Ala Tyr Ser Yal His Arg Trp He Met Bis Gly Pro Leu Gly Trp 

20 25 30 

GGC TGG CAC AAG TCC CAT CAC GAA GAG CAC GAC CAC GCG TTG GAG AAG 144 

Gly Trp His Lys Ser His His Glu Glu His Asp His Ala Leu Glu Lys 

35 40 45 

AAC GAC CTC TAC GGC GTC GTC TTC GCG GTG CTG GCG ACG ATC CTC TTC 192 

Aso Asp Leu Tyr Gly Val Yal Pbe Ala Yal Leo Ala Thr lie Leu Pbe 

50 55 60 
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ACC GTG GGC GCC TAT TGG 
Thr Val Gly Ala Tyr Trp 
65 70 
ATG ACG GTC TAT GGG TTG 
Met Thr Val Tyr Gly Leo 
85 

CAT CAA CGC TGG CCG TTT 
His Gin Arg Trp Pro Phe 
100 

AGG CTC TAG CAA GCT CAT 
Arg Leu Tyr Gin Ala His 
115 

CAC TGC GTC AGC TTC GGC 
His Cys Val Ser Phe Gly 
130 

AAG CAG GAT CTG AAG CGG 
Lys Gin Asp Leo Lys Arg 
145 150 
CCG KG TGA 
Pro Ser m 



TGG CCG GTG 
Trp Pro Val 

ATC TAT TTC 
lie Tyr Phe 

CGG TAT ATT 
Arg Tyr He 
105 

CGC CTG CAC 
Arg Leu His 
120 

TTC ATC TAT 
Phe He Tyr 
1-35 

TCG GGT GTC 
Ser Gly Val 



CTG TGG 
Leo Trp 
75 

ATC CTG 
He Leu 



TGG ATC GCC 
Trp He Ala 

CAC GAC GGG 
His Asp Gly 



CCG CGG 
Pro Arg 

CAC GCG 
His Ala. 

GCC CCA 
Ala Pro 

CTG CGC 
Leo Arg 
155 



CGG GGC TAT 
Arg Gly Tyr 
110 

GTC GAG GGG 
Val Gin Gly 

125 
CCC GTG GAC 
Pro Val Asp 
140 

CCC CAG GAC 
Pro Gin Asp 



CTG GGC 
Leu Gly 
SO 

CTT GTG 
Len Val 
95 

TTC CGC 
Phe Arg 

CGG GAC 
Arg Asp 

AAG CTG 
Lys Leu 

GAG CGT 
Glu Arg 
160 



240 



336 



384 



432 
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EfllOfiS : 1 6 3 1 

h*bs?- -.mm. 

g£?"J©ggI : Genomic DNA 
£&: 

: Alcaligenes 
: sp. PC-1 

CTGCAGGCCG GGCCCGGTGG CCAATGGTCG CAACCGGCAG GACTGGAACA GGACGGCGGG 60 
GACGTCCGGC CCGGGCCACC GGTTACCAGC GTTGGCCGTC CTGACCTTGT CCTGCCGCCC 

CCGGTCTAGG CTGTCGCCCT ACGCAGCAGG AGTTTCGGAT GTCCGGACGG AAGCCTGGCA 120 
GGCCAGATCC GACAGCGGGA TGCGTCGTCC TCAAAGCCTA CAGGCCTGCC TTCGGACCGT 

CAACTGGCGA CACGATCGTC AATCTCGGTC TGACCGCCGC GATCCTGCTG TGCTGGCTGG 180 
GTTGACCGCT GTGCTAGCAG TTAGAGCCAG ACTGGCGGCG CTAGGACGAC ACGACCGACC 

TCCTGCACGC CTTTACGCTA TGGTTGCTAG ATGCGGCCGC GCATCCGCTG CTTGCCGTGC 240 
AGGACGTGCG GAAATGCGAT ACCAACGATC TACGCCGGCG CGTAGGCGAC GAACGGCACG 
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TGTGCCTGGC TGGGCTGACC TGGCTGTCGG TCGGGCTGTT CATCATCGCG CATGACGCAA 300 
ACACGGACCG ACCCGACTGG ACCGACAGCC AGCCCGACAA GTAGTAGCGC GTACTGCGTT 

TGCACGGGTC CGTGGTGCCG GGGCGGCCGC GCGCCAATGC GGCGATCGGG CAACTGGCGC 
ACGTGCCCAG GCACCACGGC CCCGCCGGCG CGCGGTTACG CCGCTAGCCC GTTGACCGCG 360 

TGTGGCTCTA TGCGGGGTTC TCGTGGCCCA AGCTGATCGC CAAGCACATG ACGCATCACC 420 
ACACCGAGAT ACGCCCCAAG AGCACCGGGT TCGACTAGCG GTTCGTGTAC TGCGTAGTGG 

GGCACGCCGG CACCGACAAC GATCCCGATT TCGGTCACGG AGGGCCCGTG CGCTGGTACG 480 
CCGTGCGGCC GTGGCTGTTG CTAGGGCTAA AGCCAGTGCC TCCCGGGCAC GCGACCATGC 

GCAGCTTCGT CTCCACCTAT TTCGGCTGGC GAGAGGGACT GCTGCTACCG GTGATCGTCA 540 
CGTCGAAGCA GAGGTGGATA AAGCCGACCG CTCTCCCTGA CGACGATGGC CACTAGCAGT 

CCACCTATGC GCTGATCCTG GGCGATCGCT GGATGTATGT CATCTTCTGG CCGGTCCCGG 600 
GGTGGATACG CGACTAGGAC CCGCTAGCGA CCTACATACA GTAGAAGACC GGCCAGGGCC 

CCGTTCTGGC GTCGATCCAG ATTTTCGTCT TCGGAACTTG GCTGCCCCAC CGCCCGGGAC 660 
GGCAAGACCG CAGCTAGGTC TAAAAGCAGA AGCCTTGAAC CGACGGGGTG GCGGGCCCTG 

ATGACGATTT TCCCGACCGG CACAACGCGA GGTCGACCGG CATCGGCGAC CCGTTGTCAC 720 
TACTGCTAAA AGGGCTGGCC GTGTTGCGCT CCAGCTGGCC GTAGCCGCTG GGCAACAGTG 
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TACTGACCTG CTTCCATTTC GGCGGCTATC ACCACGAACA TCACCTGCAT CCGCATGTGC 780 
ATGACTGGAC GAAGGTAAAG CCGCCGATAG TGGTGCTTGT AGTGGACGTA GGCGTACACG 

CGTGGTGGCG CCTGCCTCGT ACACGCAAGA CCGGAGGCCG CGCATGACGC AATTCCTCAT 840 
GCACCACCGC GGACGGAGCA TGTGCGTTCT GGCCTCCGGC GCGTACTGCG TTAAGGAGTA 

TGTCGTGGCG ACAGTCCTCG TGATGGAGCT GACCGCCTAT TCCGTCCACC GCTGGATTAT 900 
ACAGCACCGC TGTCAGGAGC ACTACCTCGA CTGGCGGATA AGGCAGGTGG CGACCTAATA 



GCACGGCCCC CTAGGCTGGG GCTGGCACAA GTCCCATCAC GAAGAGCACG ACCACGCGTT 960 
CGTGCCGGGG GATCCGACCC CGACCGTGTT CAGGGTAGTG CTTCTCGTGC TGGTGCGCAA 

GGAGAAGAAC GACCTCTACG GCGTCGTCTT CGCGGTGCTG GCGACGATCC TCTTCACCGT 1020 
CCTCTTCTTG CTGGAGATGC CGCAGCAGAA GCGCCACGAC CGCTGCTAGG AGAAGTGGCA 

GGGCGCCTAT TGGTGGCCGG TGCTGTGGTG GATCGCCCTG GGCATGACGG TCTATGGGTT 1080 
CCCGCGGATA ACCACCGGCC ACGACACCAC CTAGCGGGAC CCGTACTGCC AGATACCCAA 

GATCTATTTC ATCCTGCACG ACGGGCTTGT GCATCAACGC TGGCCGTTTC GGTATATTCC 1140 
CTAGATAAAG TAGGACGTGC TGCCCGAACA CGTAGTTGCG ACCGGCAAAG CCATATAAGG 

GCGGCGGGGC TATTTCCGCA GGCTCTACCA AGCTCATCGC CTGCACCACG CGGTCGAGGG 1200 
CGCCGCCCCG ATAAAGGCGT CCGAGATGGT TCGAGTAGCG GACGTGGTGC GCCAGCTCCC 
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GCGGGACCAC TGCGTCAGCT TCGGCTTCAT CTATGCCCCA CCCGTGGACA AGCTGAAGCA 1260 
CGCCCTGGTG ACGCAGTCGA AGCCGAAGTA GATACGGGGT GGGCACCTGT TCGACTTCGT 

GGATCTGAAG CGGTCGGGTG TCCTGCGCCC CCAGGACGAG CGTCCGTCGT GATCTCTGAT 1320 
CCTAGACTTC GCCAGCCCAC AGGACGCGGG GGTCCT.GCTC GCAGGCAGCA CTAGAGACTA 

CCCGGCGTGG CCGCATGAAA TCCGACGTGC TGCTGGCAGG GGCCGGCCTT GCCAACGGAC 1380 
GGGCCGCACC GGCGTACTTT AGGCTGCACG ACGACCGTCC CCGGCCGGAA CGGTTGCCTG 

TGATCGCGCT GGCGATCCGC AAGGCGCGGC CCGACCTTCG CGTGCTGCTG CTGGACCGTG 1440 
ACTAGCGCGA CCGCTAGGCG TTCCGCGCCG GGCTGGAAGC GCACGACGAC GACCTGGCAC 

CGGCGGGCGC CTCGGACGGG CATACTTGGT CCTGCCACGA CACCGATTTG GCGCCGCACT 1500 
GCCGCCCGCG GAGCCTGCCC GTATGAACCA GGACGGTGCT GTGGCTAAAC CGCGGCGTGA 

GGCTGGACCG CCTGAAGCCG ATCAGGCGTG GCGACTGGCC CGATCAGGAG GTGCGGTTCC 1560 
CCGACCTGGC GGACTTCGGC TAGTCCGCAC CGCTGACCGG GCTAGTCCTC CACGCCAAGG 

CAGACCATTC GCGAAGGCTC CGGGCCGGAT ATGGCTCGAT CGACGGGCGG GGGCTGATGC 1620 
GTCTGGTAAG CGCTTCCGAG GCCCGGCCTA TACCGAGCTA GCTGCCCGCC CCCGACTACG 

GTGCGGTGAC C 1631 
CACGCCACTG G 
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A 



237 246 255 264 273 2S2 

GTG CAT GCG CTG TGG TTT CTG GAC GCA GCG GCG CAT CCC ATC CTG GCG ATC GCA 
Met His Ala Leu Trp Phe Leu Asp Ala Ala Ala His Pro lie Leu Ala lie Ala 

291 300 309 318 327 336 

AAT TTC CTG GGG CTG ACC TGG CTG TCG GTC GGA TTG TTC ATC ATC GCG CAT GAC 
Asn Phe Leu Gly Leu Thr Trp Leu Ser Val Gly Leu Phe lie lie Ala His Asp 

345 354 363 372 381 390 

GCG ATG CAC GGG TCG GTG GTG CCG GGG CGT CCG CGC GCC AAT GCG GCG ATG GGC 
Ala Met His Gly Ser Val Val Pro Gly Arg Pro Arg Ala Asn Ala Ala Met Gly 

399 408 417 426 435 444 

CAG CTT GTC CTG TGG CTG TAT GCC GGA TTT TCG TGG CGC AAG ATG ATC GTC AAG 
Gin Leu Val Leu Trp Leu Tyr Ala Gly Phe Ser Trp Arg Lys Met lie Val Lys 

453 462 471 480 489 498 

CAC ATG GCC CAT CAC CGC CAT GCC GGA ACC GAC GAC GAC CCC GAT TTC GAC CAT 
His Met Ala His His Arg His Ala Gly Thr Asp Asp Asp Pro Asp Phe Asp His 

507 516 525 534 543 552 

GGC GGC CCG GTC CGC TGG TAC GCC CGC TTC ATC GGC ACC TAT TTC GGC TGG CGC 
Gly Gly Pro Val Arg Trp Tyr Ala Arg Phe lie Gly Thr Tyr Phe Gly Trp Arg 

561 570 579 588 . 597 606 

GAG GGG CTG CTG CTG CCC GTC ATC GTG ACG GTC TAT GCG CTG ATC CTT GGG GAT 
Glu Gly Leu Leu Leu Pro Val lie Val Thr Val Tyr Ala Leu He Leu Gly Asp 

615 624 633 642 651 660 

CGC TGG ATG TAC GTG GTC TTC TGG CCG CTG CCG TCG ATC CTG GCG TCG ATC CAG 
Arg Trp Met Tyr Val Val Phe Trp Pro Leu Pro Ser He Leu Ala Ser He Gin 

669 678 687 696 705 714 

CTG TTC GTG TTC GGC ACC TGG CTG CCG CAC CGC CCC GGC CAC GAC GCG TTC CCG 
Leu Phe Val Phe Gly Thr Trp Leu Pro His Arg Pro Gly His Asp Ala Phe Pro 

723 732 741 750 759 768 

GAC CGC CAC AAT GCG CGG TCG TCG CGG ATC AGC GAC CCC GTG TCG CTG CTG ACC 
Asp Arg His Asn Ala Arg Ser Ser Arg He Ser Asp Pro Val Ser Leu Leu Thr 

777 786 795 804 813 822 

TGC TTT CAC TTT GGC GGT TAT CAT CAC GAA CAC CAC CTG CAC CCG ACG GTG CCG 
Cys Phe His Phe Gly Gly Tyr His His Glu His His Leu His Pro Thr Val Pro 



831 840 
TGG TGG CGC CTG CCC AGC ACC 
Trp Trp Arg Leu Pro Ser Thr 



849 858 867 

CGC ACC AAG GGG GAC ACC GCA TGA 
Arg Thr Lys Gly Asp Thr Ala 



I 

B 
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f 872 881 890 899 908 917 

ATG ACC AAT TTC CTG ATC GTC GTC GCC ACC GTG CTG GTG ATG GAG TTG ACG GCC 
Met Thr Asn Phe Leu lie Val Val Ala Thr Val Leu Val Met Glu Leu Thr Ala 

926 935 944 953 962 971 

TAT TCC GTC CAC CGC TGG ATC ATG CAC GGC CCC CTG GGC TGG GGC TGG CAC AAG 
Tyr Ser Val His Arg Trp lie Met His Gly Pro Leu Gly Trp Gly Trp His Lys 

980 989 998 1007 1016 1025 

TCC CAC CAC GAG GAA CAC GAC CAC GCG CTG GAA AAG AAC GAC CTG TAC GGC CTG 
Ser His His Glu Glu His Asp His Ala Leu Glu Lys Asn Asp Leu Tyr Gly Leu 

1034 1043 1052 1061 1070 1079 

GTC TTT GCG GTG ATC GCC ACG GTG CTG TTC ACG GTG GGC TGG ATC TGG GCG CCG 
Val Phe Ala Val He Ala Thr Val Leu Phe Thr Val Gly Trp He Trp Ala Pro 

1088 1097 1106 1115 1124 1133 

GTC CTG TGG TGG ATC GCC TTG GGC ATG ACT GTC TAT GGG CTG ATC TAT TTC GTC 
Val Leu Trp Trp He Ala Leu Gly Met Thr Val Tyr Gly Leu He Tyr Phe Val 

1142 1151 1160 1169 1178 1187 

CTG CAT GAC GGG CTG GTG CAT CAG CGC TGG CCG TTC CGT TAT ATC CCG CGC AAG 
Leu His Asp Gly Leu Val His Gin Arg Trp Pro Phe Arg Tyr He Pro Arg Lys 

1196 1205 1214 1223 1232 1241 

GGC TAT GCC AGA CGC CTG TAT CAG GCC CAC CGC CTG CAC CAT GCG GTC GAG GGG 
Gly Tyr Ala Arg Arg Leu Tyr Gin Ala His Arg Leu His His Ala Val Glu Gly 

1250 1259 1268 1277 1286 1295 

CGC GAC CAT TGC GTC AGC TTC GGC TTC ATC TAT GCG CCC CCG GTC GAC AAG CTG 
Arg Asp His Cys Val Ser Phe Gly Phe He Tyr Ala Pro Pro Val Asp Lys Leu 

1304 1313 1322 1331 1340 1349 

AAG CAG GAC CTG AAG ATG TCG GGC GTG CTG CGG GCC GAG GCG CAG GAG CGC ACG 
Lys Gin Asp Leu Lys Met Ser Gly Val Leu Arg Ala Glu Ala Gin Glu Arg Thr 

TGA 4" 



D 



FIG. 2 
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E 



1357 1366 1375 1384 1393 1402 

GTG ACC CAT GAC GTG CTG CTG GCA GGG GCG GGC CTT GCC AAC GGG CTG ATC GCC 
Met Thr His Asp Val Leu Leu Ala Gly Ala Gly Leu Ala Asn Gly Leu lie Ala 



1411 1420 1429 1438 1447 1456 

CTG GCG CTG CGC GCG GCG CGG CCC GAC CTG CGC GTG CTG CTG CTG GAC CAT GCC 
Leu Ala Leu Arg Ala Ala Arg Pro Asp Leu Arg Val Leu Leu Leu Asp His Ala 

1465 1474 1463 1492 1501 1510 

GCA GGA CCG TCA GAC GGC CAC ACC TGG TCC TGC CAC GAC CCC GAC CTG TCG CCG 
Ala Gly Pro Ser Asp Gly His Thr Trp Ser Cys His Asp Pro Asp Leu Ser Pro 

1519 1528 1537 1546 1555 1564 

GAC TGG CTG GCG CGG CTG AAG CCC CTG CGC CGC GCC AAC TGG CCC GAC CAG GAG 
Asp Trp Leu Ala Arg Leu Lys Pro Leu Arg Arg Ala Asn Trp Pro Asp Gin Glu 





1573 




1582 


1591 


1600 




1609 




1618 


GTG 


CGC TTT 


CCC 


CGC CAT 


GCC CGG CGG CTG 


GCC ACC 


GGT 


TAC GGG 


TCG 


CTG GAC 


Val 


Arg Phe 


Pro 


Arg His 


Ala Arg Arg Leu 


Ala Thr 


Gly 


Tyr Gly 


Ser 


Leu Asp 




1627 




1636 


1645 


1654 




1663 




1672 


GGG 


GCG GCG 


CTG 


GCG GAT 


GCG GTG GTC CGG 


TCG GGC 


GCC 


GAG ATC 


CGC 


TGG GAC 


Gly 


Ala Ala 


Leu 


Ala Asp 


Ala Val Val Arg 


Ser Gly 


Ala 


Glu He 


Arg 


Trp Asp 




1681 




1690 


1699 


1708 




1717 




1726 


AGC 


GAC ATC 


GCC 


CTG CTG 


GAT GCG CAG GGG 


GCG ACG 


CTG 


TCC TGC 


GGC 


ACC CGG 


Ser 


Asp He 


Ala 


Leu Leu 


Asp Ala Gin Gly 


Ala Thr 


Leu 


Ser Cys 


Gly 


Thr Arg 




1735 
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1897 1906 1915 1924 1933 1942 

CGC TTC ATC TAT CTG CTG CCC TTC TCT CCG ACG CGC ATC CTG ATC GAG GAC ACG 
Arg Phe He Tyr Leu Leu Pro Phe Ser Pro Thr Arg He Leu He Glu Asp Thr 

1951 1960 1969 1978 1987 1996 

CGC TAT TCC GAT GGC GGC GAT CTG GAC GAC GAC GCG CTG GCG GCG GCG TCC CAC 
Arg Tyr Ser Asp Gly Gly Asp Leu Asp Asp Asp Ala Leu Ala Ala Ala Ser His 
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2005 2014 2023 2032 2041 2050 

GAC TAT GCC CGC CAG CAG GGC TGG ACC GGG GCC GAG GTC CGG CGC GAA CGC GGC 
Asp Tyr Ala Arg Gin Gin Gly Trp Thr Gly Ala Glu val Arg Arg Glu Arg Gly 

2059 2068 2077 2086 2095 2104 

ATC CTT CCC ATC GCG CTG GCC CAT GAT GCG GCG GGC TTC TGG GCC GAT CAC GCG 
lie Leu Pro lie Ala Leu Ala His Asp Ala Ala Gly Phe Trp Ala Asp His Ala 

2113 2122 2131 2140 2149 2158 

GCG GGG CCT GTT CCC GTG GGA CTG CGC GCG GGG TTC TTT CAT CCG GTC ACC GGC 
Ala Gly Pro Val Pro Val Gly Leu Arg Ala Gly Phe Phe His Pro Val Thr Gly 

2167 2176 2185 2194 2203 2212 

TAT TCG CTG CCC TAT GCG GCA CAG GTG GCG GAC GTG GTG GCG GGT CTG TCC GGG 
Tyr Ser Leu Pro Tyr Ala Ala Gin Val Ala Asp Val Val Ala Gly Leu Ser Gly 

2221 2230 2239 2248 2257 2266 

CCG CCC GGC ACC GAC GCG CTG CGC GGC GCC ATC CGC GAT TAC GCG ATC GAC CGG 
Pro Pro Gly Thr Asp Ala Leu Arg Gly Ala lie Arg Asp Tyr Ala lie Asp Arg 

2275 2284 2293 2302 2311 2320 

GCG CGC CGC GAC CGC TTT CTG CGC CTTTTG AAC CGG ATG CTG TTC ' CGC GGC TGC 
Ala Arg Arg Asp Arg Phe Leu Arg Leu Leu Asn Arg Met Leu Phe Arg Gly Cys 

2329 2338 2347 2356 2365 2374 

GCG CCC GAC CGG CGC TAT ACC CTG CTG CAG CGG TTC TAC CGC ATG CCG CAT GGA 
Ala Pro Asp Arg Arg Tyr Thr Leu Leu Gin Arg Phe Tyr Arg Met Pro His Gly 

2383 2392 2401 2410 2419 2428 

CTG ATC GAA CGG TTC TAT GCC GGC CGG CTG AGC GTG GCG GAT CAG CTG CGC ATC 
Leu lie .Glu Arg Phe Tyr Ala Gly Arg Leu Ser Val Ala Asp Gin Leu Arg He 

2437 2446 2455 2464 2473 2482 

GTG ACC GGC AAG CCT CCC ATT CCC CTT GGC ACG GCC ATC CGC TGC CTG CCC GAA, 
Val Thr Gly Lys Pro Pro He Pro Leu Gly Thr Ala He Arg Cys Leu Pro Glu 



2491 2500 2509 

CGT CCC CTG CTG AAG GAA AAC GCA TGA 
Arg Pro Leu Leu Lys Glu Asn Ala 
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10 20 30 40 50 60 

* * * * * * 

GGATC CGGCG ACCTT GCGGC GCTGC GCCGC GCGCC TTTGC TGGTG CCTGG GCCGG GTGGC 
CCTAG GCCGC TGGAA CGCCG CGACG CGGCG CGCGG AAACG ACCAC GGACC CGGCC CACCG 

70 80 90 100 1X0 120 

* * * * * * 

CAATG GTCGC AAGCA ACGGG GATGG AAACC GGCGA TGCGG GACTG TAGTC TGCGC GGATC 
GTTAC CAGCG TTCGT TGCCC CTACC TTTGG CCGCT ACGCC CTGAC ATCAG ACGCG CCTAG 

130 140 150 160 170 180 

* * * * * * 

GCCGG TCCGG GGGAC AAGAT GAGCG CACAT GCCCT GCCCA AGGCA GATCT GACCG CCACC 
CGGCC AGGCC CCCTG TTCTA CTCGC GTGTA CGGGA CGGGT TCCGT CTAGA CTGGC GGTGG 

190 200 210 220 A 2 30 240 

» * * * w* * 

AGCCT GATCG TCTCG GGCGG CATCA TCGCC GCTTG GCTGG CCCTG CATGT GCATG CGCTG 
TCGGA CTAGC AGAGC CCGCC GTAGT AGCGG CGAAC CGACC GGGAC GTACA CGTAC GCGAC 

250 260 270 280 290 300 

* * * * * * 

TGGTT TCTGG ACGCA GCGGC GCATC CCATC CTGGC GATCG CAAAT TTCCT GGGGC TGACC 
ACCAA AGACC TGCGT CGCCG CGTAG GGTAG GACCG CTAGC GTTTA AAGGA CCCCG ACTGG 

310 320 330 340 350 360 

w * * * * * * 

TGGCT GTCGG TCGGA TTGTT CATCA TCGCG CATGA CGCGA TGCAC GGGTC GGTGG TGCCG 
ACCGA CAGCC AGCCT AACAA GTAGT AGCGC GTACT GCGCT ACGTG CCCAG CCACC ACGGC 

370 380 390 400 410 420 

****** 

GGGCG TCCGC GCGCC AATGC GGCGA TGGGC CAGCT TGTCC TGTGG CTGTA TGCCG GATTT 
CCCGC AGGCG CGCGG TTACG CCGCT ACCCG GTCGA ACAGG ACACC GACAT ACGGC CTAAA 

430 440 450 460 470 480 

****** 

TCGTG GCGCA AGATG ATCGT CAAGC ACATG GCCCA TCACC GCCAT GCCGG AACCG ACGAC 
AGCAC CGCGT TCTAC TAGCA GTTCG TGTAC CGGGT AGTGG CGGTA CGGCC TTGGC TGCTG 

490 500 510 520 530 540 

* * * • * * 

GACCC CGATT TCGAC CATGG CGGCC CGGTC CGCTG GTACG CCCGC TTCAT CGGCA CCTAT 
CTGGG GCTAA AGCTG GTACC GCCGG GCCAG GCGAC CATGC GGGCG AAGTA GCCGT GGATA 

550 560 570 580 590 600 

* * * . * * * 

TTCGG CTGGC GCGAG GGGCT GCTGC TGCCC GTCAT CGTGA CGGTC TATGC GCTGA TCCTT 
AAGCC GACCG CGCTC CCCGA CGACG ACGGG CAGTA GCACT GCCAG ATACG CGACT AGGAA 

610 620 630 640 650 660 

* * * * » * 

GGGGA TCGCT GGATG TACGT GGTCT TCTGG CCGCT GCCGT CGATC CTGGC GTCGA TCCAG 
CCCCT AGCGA CCTAC ATGCA CCAGA AGACC GGCGA CGGCA GCTAG GACCG CAGCT AGGTC 
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670 680 690 700 710 720 

****** 

CTGTT CGTGT TCGGC ACCTG GCTGC CGCAC CGCCC CGGCC ACGAC GCGTT CCCGG ACCGC 
GACAA GCACA AGCCG TGGAC CGACG GCGTG GCGGG GCCGG TGCTG CGCAA GGGCC TGGCG 



730 740 750 760 770 780 

* * * * * • 

CACAA TGCGC GGTCG TCGCG GATCA GCGAC CCCGT GTCGC TGCTG ACCTG CTTTC ACTTT 
GTGTT ACGCG CCAGC AGCGC CTAGT CGCTG GGGCA CAGCG ACGAC TGGAC GAAAG TGAAA 

790 800 810 820 830 840 

* * * * * * 

GGCGG TTATC ATCAC GAACA CCACC TGCAC CCGAC GGTGC CGTGG TGGCG CCTGC CCAGC 
CCGCC AATAG TAGTG CTTGT GGTGG ACGTG GGCTG CCACG GCACC ACCGC GGACG GGTCG 

850 860 C, 870 880 890 900 

ACCCG CACCA AGGGG GACAC CGCAT GACCA ATTTC CTGAT CGTCG TCGCC ACCGT GCTGG 
TGGGC GTGGT TCCCC CTGTG CTGGT TAAAG GACTA GCAGC AGCGG TGGCA CGACC 

910 920 g 930 940 950 960 

* * * * * * 

TGATG GAGTT GACGG CCTAT TCCGT CCACC GCTGG ATCAT GCACG GCCCC CTGGG CTGGG 
ACTAC CTCAA CTGCC GGATA AGGCA GGTGG CGACC TAGTA CGTGC CGGGG GACCC GACCC 

970 . 980 990 1000 1010 1020 

* * * * * * 

GCTGG CACAA GTCCC ACCAC GAGGA ACACG ACCAC GCGCT GGAAA AGAAC GACCT GTACG 
CGACC GTGTT CAGGG TGGTG ,CTCCT TGTGC TGGTG CGCGA CCTTT TCTTG CTGGA CATGC 

1030 1040 1050 1060 1070 1080 

* * * * * * 

GCCTG GTCTT TGCGG TGATC GCCAC GGTGC TGTTC ACGGT GGGCT GGATC TGGGC GCCGG 
CGGAC CAGAA ACGCC ACTAG CGGTG CCACG ACAAG TGCCA CCCGA CCTAG ACCCG CGGCC 

1090 1100 1110 1120 1130 1140 

****** 

TCCTG TGGTG GATCG CCTTG GGCAT GACTG TCTAT GGGCT GATCT "ATTTC GTCCT GCATG 

AGGAC ACCAC CTAGC GGAAC CCGTA CTGAC AGATA CCCGA CTAGA TAAAG CAGGA CGTAC 

) 

1150 1160 1170 1180 1190 1200 

* * * * * * 

ACGGG CTGGT GCATC AGCGC TGGCC GTTCC GTTAT ATCCC GCGCA AGGGC TATGC CAGAC 
TGCCC GACCA CGTAG TCGCG ACCGG CAAGG CAATA TAGGG CGCGT TCCCG ATACG GTCTG 

1210 1220 1230 1240 1250 1260 

****** 

GCCTG TATCA GGCCC ACCGC CTGCA CCATG CGGTC GAGGG GCGCG ACCAT TGCGT CAGCT 
CGGAC ATAGT CCGGG TGGCG GACGT GGTAC GCCAG CTCCC CGCGC TGGTA ACGCA GTCGA 

1270 1280 1290 1300 1310 1320 

****•• 

TCGGC TTCAT CTATG CGCCC CCGGT CGACA AGCTG AAGCA GGACC TGAAG ATGTC GGGCG 
AGCCG AAGTA GATAC GCGGG GGCCA GCTGT TCGAC TTCGT CCTGG ACTTC TACAG CCCGC 
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1330 1340 E 1350 1360 1370 1380 

y* 

TGCTG CGGGC CGAGG CGCAG GAGCG CACGT GACCC ATGAC GTGCT GCTGG CAGGG GCGGG 
ACGAC GCCCG GCTCC GCGTC CTCGC GTGCA CTGGG TACTG CACGA CGACC GTCCC CGCCC 

1390 1400 14ioD 1420 1430 1440 

«*•*** 
CCTTG CCAAC GGGCT GATCG CCCTG GCGCT GCGCG CGGCG CGGCC CGACC TGCGC GTGCT 
GGAAC GGTTG CCCGA CTAGC GGGAC CGCGA CGCGC GCCGC GCCGG GCTGG ACGCG CACGA 

1450 1460 1470 1480 1490 1500 

****** 
GCTGC TGGAC CATGC CGCAG GACCG TCAGA CGGCC ACACC TGGTC CTGCC ACGAC CCCGA 
CGACG ACCTG GTACG GCGTC CTGGC AGTCT GCCGG TGTGG ACCAG GACGG TGCTG GGGCT 

1510 1520 1530 1540 1550 1560 

****** 
CCTGT CGCCG GACTG GCTGG CGCGG CTGAA GCCCC TGCGC CGCGC CAACT GGCCC GACCA 
GGACA GCGGC CTGAC CGACC GCGCC GACTT CGGGG ACGCG GCGCG GTTGA CCGGG CTGGT 

1570 1580 1590 1600 1610 1620 

* * * * * * 

GGAGG TGCGC TTTCC CCGCC ATGCC CGGCG GCTGG CCACC GGTTA CGGGT CGCTG GACGG 

CCTCC ACGCG AAAGG GGCGG TACGG GCCGC CGACC GGTGG CCAAT GCCCA GCGAC CTGCC 

1630 1640 1650 1660 1670 1680 

***** * 
GGCGG CGCTG GCGGA TGCGG TGGTC CGGTC GGGCG CCGAG ATCCG CTGGG ACAGC GACAT 
CCGCC GCGAC CGCCT ACGCC ACCAG GCCAG CCCGC GGCTC TAGGC GACCC TGTCG CTGTA 

1690 1700 1710 1720 1730 1740 

***** * 
CGCCC TGCTG GATGC GCAGG GGGCG ACGCT GTCCT GCGGC ACCCG GATCG AGGCG GGCGC 
GCGGG ACGAC CTACG CGTCC CCCGC TGCGA CAGGA CGCCG TGGGC CTAGC TCCGC CCGCG 

1750 1760 1770 1780 1790 1800 

****** 
GGTCC TGGAC GGGCG GGGCG CGCAG CCGTC GCGGC ATCTG ACCGT GGGTT TCCAG AAATT • 
CCAGG ACCTG CCCGC CCCGC GCGTC GGCAG CGCCG TAGAC TGGCA CCCAA AGGTC TTTAA 

1810 1820 1830 1840 1850 1860 

**••** 
CGTGG GTGTC GAGAT CGAGA CCGAC CGCCC CCACG GCGTG CCCCG CCCGA TGATC ATGGA 
GCACC CACAG CTCTA GCTCT GGCTG GCGGG GGTGC CGCAC GGGGC GGGCT ACTAG TACCT 

1870 1880 1890 1900 1910 1920 

* * « * * * 

CGCGA CCGTC ACCCA GCAGG ACGGG TACCG CTTCA TCTAT CTGCT GCCCT TCTCT CCGAC 
GCGCT GGCAG TGGGT CGTCC TGCCC ATGGC GAAGT AGATA GACGA CGGGA AGAGA GGCTG 

1930 1940 1950 1960 1970 1980 

* * • * * * 
GCGCA TCCTG ATCGA GGACA CGCGC TATTC CGATG GCGGC GATCT GGACG ACGAC GCGCT 
CGCGT AGGAC TAGCT CCTGT GCGCG ATAAG GCTAC CGCCG CTAGA CCTGC TGCTG CGCGA 
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1990 2000 2010 2020 2030 2040 

***•«* 

GGCGG CGGCG TCCCA CGACT ATGCC CGCCA GCAGG GCTGG ACCGG GGCCG AGGTC CGGCG 
CCGCC GCCGC AGGGT GCTGA TACGG GCGGT CGTCC CGACC TGGCC CCGGC TCCAG GCCGC 

2050 2060 2070 2080 2090 2100 

****** 

CGAAC GCGGC ATCCT TCCCA TCGCG CTGGC CCATG ATGCG GCGGG CTTCT GGGCC GATCA 
GCTTG CGCCG TAGGA AGGGT AGCGC GACCG GGTAC TACGC CGCCC GAAGA CCCGG CTAGT 

2110 2120 2130 2140 2150 2160 

****** 

CGGGG CCTGT TCCCG TGGGA CTGCG CGCGG GGTTC TTTCA TCCGG TCACC GGCTA 
GCCCC GGACA AGGGC ACCCT GACGC GCGCC CCAAG AAAGT AGGCC AGTGG CCGAT 

2170 2180 2190 2200 2210 2220 

****** 

TTCGC TGCCC TATGC GGCAC AGGTG GCGGA CGTGG TGGCG GGTCT GTCCG GGCCG CCCGG 
AAGCG ACGGG ATACG CCGTG TCCAC CGCCT GCACC ACCGC CCAGA CAGGC CCGGC GGGCC 



2230 2240 2250 2260 2270 2280 

****** 

CACCG ACGCG CTGCG CGGCG CCATC CGCGA TTACG CGATC GACCG GGCGC GCCGC GACCG 
GTGGC TGCGC GACGC GCCGC GGTAG GCGCT AATGC GCTAG CTGGC CCGCG CGGCG CTGGC 

2290 2300 2310 2320 2330 2340 

****** 

CTTTC TGCGC CTTTT GAACC GGATG CTGTT CCGCG GCTGC GCGCC CGACC GGCGC TATAC 
GAAAG ACGCG GAAAA CTTGG CCTAC GACAA GGCGC CGACG CGCGG GCTGG CCGCG ATATG 

2350 2360 2370 2380 2390 2400 

****** 

CCTGC TGCAG CGGTT CTACC GCATG CCGCA TGGAC TGATC GAACG GTTCT ATGCC GGCCG 
GGACG ACGTC GCCAA GATGG CGTAC GGCGT ACCTG ACTAG CTTGC CAAGA TACGG CCGGC 

2410 2420 2430 2440 2450 2460 

****** 

GCTGA GCGTG GCGGA TCAGC TGCGC ATCGT GACCG GCAAG CCTCC CATTC CCCTT GGCAC 
CGACT CGCAC CGCCT AGTCG ACGCG TAGCA CTGGC CGTTC GGAGG GTAAG GGGAA CCGTG 



2470 2480 2490 

* * * 

GGCCA TCCGC TGCCT GCCCG AACGT CCCCT 
CCGGT AGGCG ACGGA CGGGC TTCCA GGGGA 

2530 2540 2550 

* * * 

CCCGC GGCCA AGACC GCCAT CGTGA TCGGC 
GGGCG CCGGT TCTGG CGGTA GCACT AGCCG 



2500 2510 2S20 

* * * 

GCTGA AGGAA AACGC ATGAA CGCCC ATTCG 
CGACT TCCTT TTGCG TACTT GCGGG TAAGC 

2S60 tT2570 2580 

* ' * * 

GCAGG CTTTG GCGGG CTGGC CCTGG CCATC 
CGTCC GAAAC CGCCC GACCG GGACC GGTAG 



2590 2600 2610 2620 2630 2640 

■ * • * * * 

CGCCT GCAGT CCGCG GGCAT CGCCA CCACC CTGGT CGAGG CCCGG GACAA GCCCG GCGGG 
GCGGA CGTCA GGCGC CCGTA GCGGT GGTGG GACCA GCTCC GGGCC CTGTT CGGGC CGCCC 



FIG. 8 

8/20 



WO 95/18220 



PCT/JP94/02220 



2650 2660 2670 2680 2690 2?00 

****** 
CGCGC CTATG TCTGG CACGA TCAGG GCCAT CTCTT CGACG CGGGC CCGAC CGTCA TCACC 
GCGCG GATAC AGACC GTGCT AGTCC CGGTA GAGAA GCTGC GCCCG GGCTG GCAGT AGTGG 

2710 2720 2730 2740 2750 2760 

****** 
GACCC CGATG CGCTG AAAGA GCTGT GGGCC CTGAC CGGGC AGGAC ATGGC GCGCG ACGTG 
CTGGG GCTAC GCGAC TTTCT- CGACA CCCGG GACTG GCCCG TCCTG TACCG CGCGC TGCAC 

2770 2780 2790 2800 2810 2820 

****** 
ACGCT GATGC CGGTC TCGCC CTTCT ATCGG CTGAT GTGGC CGGGC GGGAA GGTCT TCGAT 
TGCGA CTACG GCCAG AGCGG GAAGA TAGCC GACTA CACCG GCCCG CCCTT CCAGA AGCTA 

2830 2840 2850 2860 2870 2880 

* * * * ... * * 

TACGT GAACG AGGCC GATCC AGGGT CTGGG TCTTG CCGTG CCAGG TGAAG CTGTT GCCGT 
ATGCA CTTGC TCCGG CTAGG TCCCA GACCC AGAAC GGCAC GGTCC ACTTC GACAA CGGCA 

2886 
* 

GGATC C 
CCTAG G 
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I 110 120 130 140 150 

ATGTCCGGACGGAAGCCTGGCACAACTGGCGACACGATCGTCAATCTCGGTCTGACCGCC 
1 MetSerGlyArgLysProGlyThrThrGlyAspThrlleValAsnLeuGlyLeuThrAla 

160. 170 180 190 200 210 

GCGATCCTGCTGTGCTGGCTGGTCCTGCACGCCTTTACGCTATGGTTGCTAGATGCGGCC 
21 AlalleLeuLeuCysTrpLeuValLeuHiaAlaPheThrLeuTrpLeuLeuAspAlaAla 

220 230 240 250 260 270 

GCGCATCCGCTGCTTGCCGTGCTGTGCCTGGCTGGGCTGACCTGGCTGTCGGTCGGGCTG 
41 AlaHisProLeuLeuAlaValLeuCysLeuAlaGlyLeuThrtrpLeuSerValGlyLeu 

280 290 300 310 320 330 

TTCATCATCGCGCATGACGCAATGCACGGGTCCGTGGTGCCGGGGCGGCCGCGCGCCAAT 
61 PhellelleAlaHiaAspAlaMetHiaGlySerValValProGlyArgProArgAlaAsn 

340 350 360 370 380 390 

GCGGCGATCGGGCAACTGGCGCTGTGGCTCTATGCGGGGTTCTCGTGGCCCAAGCTGATC 
81 AlaAlalleGlyGlnLeuAlaLeuTrpLeuTyrAlaGlyPheSerTrpProLysLeuIle 

400 410 420 430 440 450 

GCCAAGCACATGACGCATCACCGGCACGCCGGCACCGACAACGATCCCGATTTCGGTCAC 
101 AlaLysHisMetThrHisHisArgHisAlaGlyThrAspAsnAspProAspPheGlyHis 

460 470 480 490 500 510 

GGAGGGCCCGTGCGCTGGTACGGCAGCTTCGTCTCCACCTATTTCGGCTGGCGAGAGGGA 
121 GlyGlyProValArgTrpTyrGlySerPheValSerThrTyrPheGlyTrpArgGluGly 

520 530 540 550 560 570 

CTGCTGCTACCGGTGATCGTCACCACCTATGCGCTGATCCTGGGCGATCGCTGGATGTAT 
141 ^euLeuLeuProVallleValThrThrTyrAlaLeuIleLeuGlyAspArgTrpMetTyr 

580 590 600 610 620 630 

GTCATCTTCTGGCCGGTCCCGGCCGTTCTGGCGTCGATCCAGATTTTCGTCTTCGGAACT 
161 ValllePheTrpProValProAlaValLeuAlaSerlleGlnllePheValPheGlyThr 

640 650 660 670 680 690 

TGGCTGCCCCACCGCCCGGGACATGACGATTTTCCCGACCGGCACAACGCGAGGTCGACC 
181 TrpLeuProHisArgProGlyHisAspAspPheProAspArgHisAsnAlaArgSerThr 

700 710 720 730 740 750 

GGCATCGGCGACCCGTTGTCACTACTGACCTGCTTCCATTTCGGCGGCTATCACCACGAA 
201 GlylleGlyAspProLeuSerLeuLeuThrCysPheHisPheGlyGlyTyrHisHisGlu 



FIG. 13 



13/20 



WO 95/18220 



PCT/JP94/02220 



760 770 780 790 800 810 

CATCACCTGCATCCGCATGTGCCGTGGTGGCGCCTGCCTCGTACACGCAAGACCGGAGGC 
221 HisHisLeuHisProHisValProTrpTrpArgLeuProArgThrArgLysThrGlyGly 

820 827 
CGCGCATGA 
241 ArgAla*** 



te 
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w * 830 840 850 860 870 880 

ATGACGCAATTCCTCATTGTCGTGGCGACAGTCCTCGTGATGGAGCTGACCGCCTATTCC 
1 MetThrGlnPheLeuIleValValAlaThrValLeuValMetGluLeuThrAlaTyrSer 

890 900 910 920 930 940 

GTCCACCGCTGGATTATGCACGGCCCCCTAGGCTGGGGCTGGCACAAGTCCCATCACGAA 
21 ValHisArgTrpIleMetHisGlyProLeuGlyTrpGlyTrpHisLysSerHisHisGlu 

950 960 970 980 990 1000 

GAGCACGACCACGCGTTGGAGAAGAACGACCTCTACGGCGTCGTCTTCGCGGTGCTGGCG 
41 GluHisAspHisAlaLeuGluLysAsnAspLeuTyrGlyValValPheAlaValLeuAla 

1010 1020 1030 1040 1050 1060 

ACGATCCTCTTCACCGTGGGCGCCTATTGGTGGCCGGTGCTGTGGTGGATCGCCCTGGGC 
61 ThrlleLeuPheThrValGlyAlaTyrTrpTrpProValLeuTrpTrpIleAlaLeuGly 

1070 1080 1090 1100 1110 1120 

ATGACGGTCTATGGGTTGATCTATTTCATCCTGCACGACGGGCTTGTGCATCAACGCTGG 
81 MetThrValTyrGlyLeuIleTyrPhelleLeuHisAspGlyLeuValHisGlnArgTrp 

1130 1140 1150 1160 1170 1180 

CCGTTTCGGTATATTCCGCGGCGGGGCTATTTCCGCAGGCTCTACCAAGCTCATCGCCTG 
101 ProPheArgTyrlleProArgArgGlyTyrPheArgArgLeuTyrGlnAlaHisArgLeu 

1190 1200 1210 1220 1230 1240 

CACCACGCGGTCGAGGGGCGGGACCACTGCGTCAGCTTCGGCTTCATCTATGCCCCACCC 
121 HisHisAlaValGluGlyArgAapHisCysValSerPheGlyPhelleTyrAlaProPro 

1250 1260 1270 1280 1290 1300 

GTGGACAAGCTGAAGCAGGATCTGAAGCGGTCGGGTGTCCTGCGCCCCCAGGACGAGCGT 
141 ValAspLysLeuLysGlnAspLeuLysArgSerGlyValLeuArgProGlnAspGluArg 

1312 
CCGTCGTGA 
161 ProSer*** 

Id 
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10 20 30 40 50 60 

CTGCA GGCCG GGCCC GGTGG CCAAT GGTCG CAACC GGCAG GACTG GAACA GGACG GCGGG 
GACGT CCGGC CCGGG CCACC GGTTA CCAGC GTTGG CCGTC CTGAC CTTGT CCTGC CGCCC 

Al 

70 80 90 f 110 120 

CCGGT CTAGG CTGTC GCCCT ACGCA GCAGG AGTTT CGGAT GTCCG GACGG AAGCC TGGCA 
GGCCA GATCC GACAG CGGGA TGCGT CGTCC TCAAA GCCTA CAGGC CTGCC TTCGG ACCGT 

130 140 .150 160 170 180 

CAACT GGCGA CACGA TCGTC AATCT CGGTC TGACC GCCGC GATCC TGCTG TGCTG GCTGG 
GTTGA CCGCT GTGCT AGCAG TTAGA GCCAG ACTGG CGGCG CTAGG ACGAC ACGAC CGACC 

190 200 210 220 230 240 

TCCTG CACGC CTTTA CGCTA TGGTT GCTAG ATGCG GCCGC GCATC CGCTG CTTGC CGTGC 
AGGAC GTGCG GAAAT GCGAT ACCAA CGATC TACGC CGGCG CGTAG GCGAC GAACG GCACG 

250 260 270 280 290 300 

TGTGC CTGGC TGGGC TGACC TGGCT GTCGG TCGGG CTGTT CATCA TCGCG CATGA CGCAA 
ACACG GACCG ACCCG ACTGG ACCGA CAGCC AGCCC GACAA GTAGT AGCGC GTACT GCGTT 

310 320 330 340 350 360 

TGCAC GGGTC CGTGG TGCCG GGGCG GCCGC GCGCC AATGC GGCGA TCGGG CAACT GGCGC 
ACGTG CCCAG GCACC ACGGC CCCGC CGGCG CGCGG TTACG CCGCT AGCCC GTTGA CCGCG 

370 380 390 400 410 420 

TGTGG CTCTA TGCGG GGTTC TCGTG GCCCA AGCTG ATCGC CAAGC ACATG ACGCA TCACC 
ACACC GAGAT ACGCC CCAAG AGCAC CGGGT TCGAC TAGCG GTTCG TGTAC TGCGT AGTGG 

430 440 450 460 470 480 

GGCAC GCCGG CACCG ACAAC GATCC CGATT TCGGT CACGG AGGGC CCGTG CGCTG GTACG 
CCGTG CGGCC GTGGC TGTTG CTAGG GCTAA AGCCA GTGCC TCCCG GGCAC GCGAC CATGC 

490 500 510 520 530 540 

GCAGC TTCGT CTCCA CCTAT TTCGG CTGGC GAGAG GGACT GCTGC TACCG GTGAT CGTCA 
CGTCG AAGCA GAGGT GGATA AAGCC GACCG CTCTC CCTGA CGACG ATGGC CACTA GCAGT 

550 560 570 580 590 600 

CCACC TATGC GCTGA TCCTG GGCGA TCGCT GGATG TATGT CATCT TCTGG CCGGT CCCGG 
GGTGG ATACG CGACT AGGAC CCGCT AGCGA CCTAC ATACA GTAGA AGACC GGCCA GGGCC 

610 620. 630 640 650 660 

CCGTT CTGGC GTCGA TCCAG ATTTT CGTCT TCGGA ACTTG GCTGC CCCAC CGCCC GGGAC 
GGCAA GACCG CAGCT AGGTC TAAAA GCAGA AGCCT TGAAC CGACG GGGTG GCGGG CCCTG 

670 680 690 700 710 720 

ATGAC GATTT TCCCG ACCGG CACAA CGCGA GGTCG ACCGG CATCG GCGAC CCGTT GTCAC 
TACTG CTAAA AGGGC TGGCC GTGTT GCGCT CCAGC TGGCC GTAGC CGCTG GGCAA CAGTG 



FIG. 16 

16/20 



WO 95/18220 



PCT7JP94/02220 



730 740 750 760 770 780 

TACTG ACCTG CTTCC ATTTC GGC6G CTATC ACCAC GAACA TCACC TGCAT CCGCA TGTGC 
ATGAC TGGAC GAAGG TAAAG CCGCC GATAG TGGTG CTTGT AGTGG ACGTA GGCGT ACACG 

790 800 810 820 ^1 830 840 

CGTGG TGGCG CCTGC CTCGT ACACG CAAGA CCGGA GGCCG CGCAT GACGC AATTC CTCAT 
GCACC ACCGC GGACG GAGCA TGTGC GTTCT GGCCT CCGGC GCGTA CTGCG TTAAG GAGTA 



850 860 870 880 \q 890 900 

TGTCG TGGCG ACAGT CCTCG TGATG GAGCT GACCG CCTAT TCCGT CCACC GCTGG ATTAT 
ACAGC ACCGC TGTCA GGAGC ACTAC CTCGA CTGGC GGATA AGGCA GGTGG CGACC TAATA 

910 920 930 940 950 960 

GCACG GCCCC CTAGG CTGGG GCTGG CACAA GTCCC ATCAC GAAGA GCACG ACCAC GCGTT 
CGTGC CGGGG GATCC GACCC CGACC GTGTT CAGGG TAGTG CTTCT CGTGC TGGTG CGCAA 

970 980 990 1000 1010 1020 

GGAGA AGAAC GACCT CTACG GCGTC GTCTT CGCGG TGCTG GCGAC GATCC TCTTC ACCGT 
CCTCT TCTTG CTGGA GATGC CGCAG CAGAA GCGCC ACGAC CGCTG CTAGG AGAAG TGGCA 

1030 1040 1050 1060 1070 1080 

GGGCG CCTAT TGGTG GCCGG TGCTG TGGTG GATCG CCCTG GGCAT GACGG TCTAT GGGTT 
CCCGC GGATA ACCAC CGGCC ACGAC ACCAC CTAGC GGGAC CCGTA CTGCC AGATA CCCAA 

1090 1100 1110 1120 1130 1140 

GATCT ATTTC ATCCT GCACG ACGGG CTTGT GCATC AACGC TGGCC GTTTC GGTAT ATTCC 
CTAGA TAAAG TAGGA CGTGC TGCCC GAACA CGTAG TTGCG ACCGG CAAAG CCATA TAAGG 

1150 1160 1170 1180 * 1190 1200 

GCGGC GGGGC TATTT CCGCA GGCTC TACCA AGCTC ATCGC CTGCA CCACG CGGTC GAGGG 
CGCCG CCCCG ATAAA GGCGT CCGAG ATGGT TCGAG TAGCG GACGT GGTGC GCCAG CTCCC 

1210 1220 1230 1240 1250 1260 

GCGGG ACCAC TGCGT CAGCT TCGGC TTCAT CTATG CCCCA CCCGT GGACA AGCTG AAGCA 
CGCCC TGGTG ACGCA GTCGA AGCCG AAGTA GAT AC GGGGT GGGCA CCTGT TCGAC TTCGT 

1270 12B0 1290 1300 1310 1320 

GGATC TGAAG CGGTC GGGTG TCCTG CGCCC CCAGG ACGAG CGTCC GTCGT GATCT CTGAT 
CCTAG ACTTC GCCAG CCCAC AGGAC GCGGG GGTCC TGCTC GCAGG CAGCA CTAGA GACTA 



V 



1330 1340 1350 1360 137b 1380 

CCCGG CGTGG CCGCA TGAAA TCCGA CGTGC TGCTG GCAGG GGCCG GCCTT GCCAA CGGAC 
GGGCC GCACC GGCGT ACTTT AGGCT GCACG ACGAC CGTCC CCGGC CGGAA CGGTT GCCTG 

1390 1400 1410 1420 1430 1440 

TGATC GCGCT GGCGA TCCGC AAGGC GCGGC CCGAC CTTCG CGTGC TGCTG CTGGA CCGTG 
ACTAG CGCGA CCGCT AGGCG TTCCG CGCCG GGCTG GAAGC GCACG ACGAC GACCT GGCAC 
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1450 1460 1470 1480 1490 1500 

CGGCG GGCGC CTCGG ACGGG CATAC TTGGT CCTGC CACGA CACCG ATTTG GCGCC GCACT 
GCCGC CCGCG GAGCC TGCCC GTATG AACCA GGACG GTGCT GTGGC TAAAC CGCGG CGTGA 



1510 1520 1530 

GGCTG GACCG CCTGA AGCCG ATCAG GCGTG 
CCGAC CTGGC GGACT TCGGC TAGTC CGCAC 

1570 1580 1590 

CAGAC CATTC GCGAA GGCTC CGGGC CGGAT 
GTCTG GTAAG CGCTT CCGAG GCCCG GCCTA 

1631 

GTGCG GTGAC C 
CACGC CACTG G 



1540 1550 1560 

GCGAC TGGCC CGATC AGGAG GTGCG GTTCC 
CGCTG ACCGG GCTAG TCCTC CACGC CAAGG 

1600 1610 1620 

ATGGC TCGAT CGACG GGCGG GGGCT GATGC 
TACCG AGCTA GCTGC CCGCC CCCGA CTACG 
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